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ANALYSIS OF A VERBAL CONDITIONING SITUATION IN 
TERMS OF STATISTICAL LEARNING THEORY? 


W. K. ESTES AND J. H. STRAUGHAN 


Indiana University 


It is the purpose of this study to 
investigate the theoretical significance 
of a rather striking coincidence be- 
tween an experimental fact and a 
mathematical fact. The experimental 
fact has been established in the 
Humphreys-type “verbal condition- 
ing” situation. In this situation S is 
asked to predict on each of a series of 
trials whether some designated event, 
e.g., the flash of a light, will occur; 
this event, the analogue of the US in a 
conditioning experiment, is presented 
in accordance with a predetermined 
schedule, usually random with some 
fixed probability. Several recent in- 
vestigators (3, 5) have noted that S 
tends to match his response rate to 
the rate of occurrence of the predicted 
event so that if the probability of the 
latter is, say, .75, the mean response 
curve for a group of Ss tends over a 
series of trials toward an apparently 
stable final level at which the event is 
predicted on approximately 75% of 
the trials. This behavior has seemed 
. puzzling to most investigators since it 
does not maximize the proportion of 
successful predictions and thus does 

1 This research was facilitated by the senior 


author’s tenure as a faculty research fellow of the 
Social Science Research Council. 


not conform to conventional law of 
effect doctrine. The mathematical 
fact which will concern us appeared in 
the course of developing the formal 
consequences of statistical association 
theory (1, 2); in a simple associative 
learning situation satisfying certain 
conditions of symmetry, the theoretical 

Mptote of response probability 
turns out to be equal to the probabil- 
ity of reinforcement. The reasoning 
involved may be sketched briefly as 
follows. 

We consider a situation in which 
each trial begins with presentation of 
a signal, or CS; following the signal, 
one or the other of two reinforcing 
stimuli, E, or Es, occurs, the proba- 
bility of E, and E,.during a given 
series being and 1—z, respectively. 
The behaviors available to S are cate- 
gorized into two classes, A; and A», by 
experimental criteria. In the verbal 
conditioning situation, A; is a predic- 
tion that E, will occur, and A: a pre- 
diction that E, will occur on the given 
trial. We assume og the CS deter- 
mines a population, S of stimulus 
elements which is eatatted by S on 
each trial, the proportion @ of the 
elements in this population constitut- 
ing the effective sample on any one 
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trial. The dependence of S’s responses 
upon the stimulating situation is ex- 
pressed in the theory by defining a 
conditional relationship such that each 
element in S, is conditioned to (tends 
to evoke) either A, or Ao. In order to 
interpret the formal model in terms of 
a verbal conditioning experiment, we 
assume that when an E, occurs it 
evokes from S a response belonging to 
class Aj, i.e., one which is compatible 
with the response of predicting E; but 
which interferes with the response of 
predicting E,, and that when an E, 
occurs it evokes a response of class Ag. 
Then on a trial on which E; occurs we 
expect on the basis of association prin- 
ciples (1) that all elements sampled 
from §S, on the trial will become con- 
ditioned to A; while on an E; trial the 
sample will be conditioned to Az. 
Now if successive trials are sufficiently 
discrete so that samples from S, are 
statistically independent, the proba- 
bility of an A; after Trial n, abbrevi- 
ated p(n), is defined in the model as 
the proportion of elements in S, that 
are conditioned to A;, and similarly 
for the probability of an A», [1—p(m) J. 
With these definitions the rule for cal- 
culating the change in response prob- 
ability on an E, trial may be stated 
formally as 


p(n + 1) = (1-0)p(n) + (1) 


and on an Es trial as 


p(n +1) = (1-#)p(n). = (2) 


The genesis of these equations will be 
fairly obvious. The proportion (1-#) 
of stimulus elements is not sampled, 
and the status of elements that are not 
sampled on a trial does not change; 
the proportion @ is sampled and these 
elements are all conditioned either to 
A, or to A, accordingly as an E, or an 
E, occurs.2 Now in a random rein- 


2 Consequently the functions derived in this 
paper should be expected to apply only to learn- 


forcement situation, Equation 1 will 
be applicable on the proportion x of 


trials and Equation 2-on the propor- 


tion (l-r); then the average proba- 
bility of A, after Trial n + 1 will be 
given by the relation 


p(n + 1) = x[ (1-9) p(n) + 8) 
+ (1—-r)(1—@)p(n) (3) 
= (1 — 6) p(n) + Or. 


If a group of Ss begins an experiment 
with the value (0), then at the end 
of Trial 1 we would have 


p(1) = (1 — 6) pO) + éz, 
at the end of Trial 2 


p(2) = (1 — @)[(1 — 6) pO) 
+ On] + Ox 
=r—([r— p(0)](1 — 4)’, 


and so on for successive trials; in 
general it can be shown by induction 
that at the end of the ath trial 


p(n) =x—[w— pO) (1-46). (4) 


Since (1 — 6) must be a fraction be- 
tween zero and one, it will be seen 
that Equation 4 must be a negatively 
accelerated curve running from the 
initial value #(0) to the asymptotic 
value x. 

This outcome of the statistical learn- 
ing model is rather surprising at first 
since it makes asymptotic response 
probability depend solely upon the 
probability of reinforcement. It seems, 
however, to be in excellent agreement 
with the experimental results of Grant, 
Hake, and Hornseth (3) and Jarvik 
(5). The question that interests us 


ing situations which are symmetrical in the fol- 
lowing sense. ‘To each response class there must 
correspond a reinforcing condition which, if pres- 
ent on any trial, ensures that a response belong- 
ing to the class will terminate the trial. These 
functions should, for example, be applicable to 
learning of a simple left-right discrimination with 
correction; but not to a left-right discrimination 
without correction, to free responding in the 
Skinner box, or to Pavlovian conditioning. 
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now is whether this agreement is to be 
regarded as a remarkable coincidence 
or as a confirmation of the theory. 
We cannot estimate a confidence level 
for the latter conclusion since the ex- 
periments were not conducted specifi- 
cally to test the theory, and we cannot 
guarantee that we would be as alert to 
notice results contrary to the theory 
which might appear in the literature 
as we have been in the case of these 
decidedly positive instances. It has 
seemed to us that the least objection- 
able way out of this impasse is to 
carry out some new experiments, mak- 
ing use of one of the convenient fea- 
tures of a mathematical theory, namely, 
that if it will generate one testable 
prediction for a given experimental 
situation, it can generally be made to 
yield many more. In the experiment 
to be reported we have tried to set up 
a situation similar in essentials to that 
used by Humphreys, Grant, and others 
with an experimental design which 
would permit testing of a variety of 
consequences of the theory. Each S 
was run through two successive series 
of 120 trials in an individualized modi- 
fication of the Humphreys situation 
with the schedule of x values shown in 
Table 1. Within the first series we 
will be able to compare learning rates 
and asymptotes of groups starting 
from similar initial values but exposed 
to different probabilities of reinforce- 
ment; within the second series we will 
be able to compare groups starting at 
different initial values but exposed to 
the same probabilities of reinforce- 
ment. Comparison of Group I with 
the other groups over both series will 
permit evaluation of the stability of 
learning rate (@ value) from series to 
series when the w value does or does 
not change. Series I, and series IIp 
will provide a comparison in which 
initial response probabilities and + 
values are the same but the amount of 
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TABLE 1 


ExpeRIMENTAL Desicn tn Terms or Prosa- 
BILITY OF REINFORCEMENT (# VALUE) 
puRING Eacu Series 











Group N by = a Teighs 129-960 
I 16 30 30 
II 16 50 30 
III 16 85 30 














preceding reinforcement differs. In 
order to separate the effect of over-all 
m value from that of particular orders 
of event occurrences, each of the three 
groups indicated in Table 1 has been 
subdivided into four subgroups of four 
Ss each; within a treatment group, say 
Group I, all subgroups have the same 
mw value but each receives a separate 
randomly drawn sequence of E,’s and 
E,’s. 
METHOD 


Apparatus—The experiment was run in a 
room containing a 2-ft. square signal board and 
four booths. Upon the signal board were 
mounted 12 12-v., .25-amp. light bulbs spaced 
evenly in a circle 18 in. in diameter. The bulbs 
occupied the half-hour positions of a clock face. 
Only the top two lights on the board were used 
as signals in this experiment. The signal board 
was mounted vertically on a table 40 in. high 
and was about 5 ft. in front of Ss’ booths. 

The booths were made from two 30 X 60 in. 
tables, 30 in. high, placed end to end but meeting 
at an angle so that Ss sitting behind them would 
be facing almost directly toward the signal board, 
about 7 ft. in front of Ss’ eyes. Two Ss sat at 
each table, The four Ss were separated from 
one another by panels 2 ft. high and 32 in. wide. 
These panels were mounted vertically on the 
table tops so as to extend 14} in. beyond the 
edge of the table between the seated Ss. 

In each booth, 18 in. back from S’s edge of 
the table, was a wooden panel 12 in. high 
mounted vertically on the table top and extend- 
ing across the width of the booth. On the side 
of this panel facing S were two reinforcing lights 
of the same size as those on the signal board but 
covered by white, translucent lenses. These 
lights were directly in front of S, 4 in. apart and 
8 in. above the table top. On the table below 
each reinforcing light was a telegraph key. 

The orders of presentation and the durations 
of the signal lights and reinforcing lights were 
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controlled by a modified Esterline-Angus re- 
corder using a punched tape and a system of 
electrical pick-up brushes. The recorder was 
placed on the table behind the signal board. 
Recorder pens which were activated by depres- 
sion of the telegraph keys in Ss’ booths were 
mounted between the brushes. Thus, the pre- 
sentations of the lights and Ss’ responses were 
recorded on the same tape. A panel light was 
mounted above the Esterline-Angus recorder so 
that E, seated behind the signal board, could 
watch the operation of brushes and pens during 
the experiment. 

Windows in the experimental room were cov- 
ered with opaque material and the experiment 
was run in darkness except for light that came 
from the apparatus. 

Subjects—The Ss were 48 students obtained 
from beginning lecture courses in psychology 
during the fall semester of 1952 and assigned at 
random to experimental groups. 

Procedure.—At the beginning of a session, Ss 
were brought into the room, asked to be seated, 
and read the following instructions: 

“Be sure you are seated comfortably; it will 
be necessary to keep one hand resting lightly 
beside each of the telegraph keys throughout the 
experiment and to watch both the large board in 
the front of the room and the two small lights in 
your own compartment. Your task in this ex- 
periment will be to outguess the experimenter on 
each trial, or at least as often as you can. The 
ready signal on each trial will be a flash from the 
two top lights on the big board. About a second 
later either the left or the right lamp in your 
compartment will light for a moment. As soon 
as the ready signal flashes you are to guess 
whether the left or the right lamp will light on 
that trial and indicate your choice by pressing 
the proper key. If you expect the left lamp to 
light, press the left key; if you expect the right 
lamp to light, press the right key; if you are not 
sure, guess. Be sure to make your choice as soon 
as the ready signal appears, press the proper key 
down firmly, then release the key before the 
ready signal goes off. It is important that you 
press either the left or the right key, never both, 
on each trial, and that you make your decision 
and indicate your choice while the signal light 
is on. 

“Now we will give you four practice trials.” 

At this point the overhead lights were extin- 
guished and the recorder started. If any obvi- 
ous mistakes were made by S during the four 
practice trials, they were pointed out by E£. 
During the four practice trials the reinforcing 
lights were always given in the order: E;, Ej, 
E2, Ez. After the practice trials the following 
instructions were read: : 


“Are you sure you understand all of the in- 
structions so far? The rest of the trials will 
have to be run off without any conversation or 
other interruptions. Please make a choice on 
every trial even if it seems difficult. Make a 
guess on the first trial, then try to improve your 
guesses as you go along and make as many cor- 
rect choices as possible.” 

Questions were answered by rereading or 
paraphrasing the appropriate part of the instruc- 
tions. If there were any questions about tricks 
the following additional paragraph was read. 

“We have told you everything that will 
happen. There are no tricks or catches in this 
experiment. We simply want to see how well 
you can profit from experience in a rather diffi- 
cult problem-solving situation while working 
under time pressure.” 

The recorder was now started again and the 
240 experimental trials were run off in a con- 
tinuous sequence with no break or other indica- 
tion to S$ at the transition from Series A to 
Series B. On each trial, the signal lamps were 
lighted for approximately 2 sec.; 1 sec. later the 
appropriate reinforcing light in each S’s booth 
lighted for .8 sec.; then after an interval of .4 
sec. the next ready signal appeared; and so on. 
The high rate of stimulus presentation was used 
in order to minimize verbalization on the part of 
Ss. 


REsuULTs AND Discussion 


Terminal response probabilities. —It 
will be clear from our discussion of 
Equation 4 that the predicted asymp- 
tote for each series will be the value 
of w obtaining during the series. We 
have taken the mean proportion of A; 
responses during the last 40 trials of 
each series as an estimate of terminal 
response probability, and these values 
are summarized for all groups and 
both series in Table 2. 


TABLE 2 


TerminaL Mean Response ProsasBiLiries 
For Eacu Series 








Series A Series B 
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For the first series a simple analysis 
of variance yields an F significant 
beyond the .001 level for differences 
among means. From the _ within- 
groups variance estimate we obtain a 
value for the standard error of a group 
mean, and this is used in the ¢ test 
between each group mean and the 
appropriate theoretical mean. For 
the second series the between-groups 
F has a probability between the .05 
and .10 levels. In neither series were 
differences among subgroup means 
significant at the .05 level. 

The interpretation seems straight- 
forward. Group III approximates 
the theoretical asymptote in both 
series. Group I falls significantly 
short of the theoretical asymptote in 
the first series but approximates it in 
the second series. Group II falls sig- 
nificantly short of the theoretical 
asymptote in the second series, but 
reaches the same probability level as 
had Group I in the first series. Of the 
t tests computed for differences be- 
tween the last two blocks of 20 trials 
in each series, all yielded probabilities 
greater than .10 except the? for Series 
IIp which was significant at the .02 
level. Evidently the predictions con- 
cerning mean asymptotic values are 
correct, but the rate of approach to 
asymptote is faster with Group III 
than under the other conditions. 

According to theory, not only group 
means, but also individual curves 
should approach # asymptotically. 
To obtain evidence as to the tenability 
of this aspect of the theory ‘we have 
examined the distributions of indi- 
vidual A; response proportions for the 
last 40 trials of Series III,4, III, and 
Ip. If all individual p values approxi- 
mate the theoretical asymptotes over 
these trials, then for each of the series 
the individual response proportions 
should cluster around the mean value, 
7, with an approximately binomial 
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resenting mean proportion of E; predictions (A; 
responses) per 20-trial block for each series 


distribution. Taking the theoretical ¢ 
equal to ¥40x(1 — x), which is actu- 
ally a slight underestimate of the true 
value, we find that approximately half 
of the scores in each serjes fall within 
one a of the theoretical asymptote and 
only one score in each series deviates 
by more than three o. It appears, 
then, that except for a few widely devi- 
ant cases the p values of individual Ss 
approach the theoretical asymptote. 

One might raise a question as to just 
what is meant by the asymptote of an 
empirical curve in a situation of this 
kind. Naturally one would not expect 
the Ss to perform at constant rates 
indefinitely. It does not seem that 
any sort of breaking point was ap- 
proached in the present study, how- 
ever; one subgroup of Group I was 
run for an additional 60 trials beyond 
Trial 240 and maintained an average 
proportion of .304 A; responses over 
these trials. 

Mean learning curves—In Fig. 1 
mean data are plotted in terms of the 
proportion of A; responses per block of 





230 W. K. ESTES AND J. H. STRAUGHAN 


20 trials. The theoretical function 
which should describe these empirical 
curves is readily obtained from Equa- 
tion 4. Letting m be the ordinal num- 
ber of a block of 20 trials running from 
Trial n + 1 to Trial n + 20 inclusive, 
and P(m) the expected proportion of 
A, responses in the block, we can write 





P(m) =r 
_ Er — pO] — 9% 
20 6 
x [1 — (1 — 0)” (5) 


this expression being simply the mean 
value of p(n) over the mth block of 20 
trials. According to theory, Equa- 
tion 5 should describe each of the 
mean curves of Fig. 1 once numerical 
values are substituted for the param- 
eters 1, (0), and @; furthermore, the 
value of @ required should not differ 
among groups within either series and 
should be constant from series to series 
for each group. The values of w are 
of course fixed by the experimental 
procedure. The values of p(0) in the 
first series should be in the neighbor- 
hood of .50, but for groups of size 16 
sampling deviations could be quite 
large so it will be best to get rid of 
p(0) in favor of P(1) which can be 
measured more accurately. To do 
this we write Equation 5 for m = 1 


OO) TE 501 
x — p(0 - 
re ee ta 
then solve for [x — (0) ] 


206 [x — P(1 





and substitute this result into Equa- 
tion 5 giving 


P(m) =r 


~ {se — POI -— gre.’ © 


Observed values of P(1) turn out to 
be .58 and .59 for Series I, and IIIa, 


respectively. Now we lack only em- 
pirical estimates of @ and these can be 
obtained by a simple statistical pro- 
cedure. The method we have used is 
to sum Equation 6 over all values of 
m, obtaining for K blocks of trials 


K 

Sr —[e— PU] — em 

en ln 
i-(G—-e» 





then equate Equation 7 to the sum of 
the observed proportions for a given 


series and solve for @. For Group I 


we obtain the estimate @ = .018 and 
for III,,@ = .08. Using these param- 
eter values we have computed the 
theoretical curves for Group I and for 
the first series of Group III, which 
may be seen in Fig. 1. In this anal- 
ysis we find agreement between data 
and theory in one respect but not in 
another. The theoretical curves pro- 
vide reasonably good descriptions of 
the observed points, especially in the 
case of Group I, but the 6 values for 
the two groups are by no means equal. 
The latter finding does not come as a 
surprise inasmuch as we had found in 
the previous section that Group I was 
significantly short of its theoretical 
asymptote in the first series, while 
Group IH was not. } 
We did not try to estimate a @ value 
for the first series of Group II since the 
empirical curve is virtually horizontal 
and closely approximates the line 
P(m) = x = .50. We could proceed 
to estimate @ values for Series IIs 
and IIIg by the method used above, 
but it will be of more interest to con- 
struct predicted curves for these series 
without using any additional informa- 
tion from the data. According to the 
theory, it should be possible to com- 
pute those curves from information 
already at our disposal. The (0) 
values in the second series should be 








PREDICTIONS 








VERBAL CONDITIONING 


TABLE 3 


PREDICTED AND OssERVED MEAN FREQUENCIES 
OF THE A; RESPONSE IN THE 
Seconp SERIES 














Group Observed Predicted t 
I 37.19 37.74 0.22 
II 46.00 45.94 0.02 
Ill 42.75 42.86 0.04 
F 3.16 (p>.05) 














the theoretical asymptotes of the first 
series, or .50 and .85 for Groups II 
and III, respectively. The only pro- 
cedural difference between I, and IIs 
lies in the number of preceding rein- 
forcements; according to the statis- 
tical model, however, this variable 


120- 


231 


will be expected to have no effect 
except insofar as it leads to a change in 
p(0), so except for sampling error the 
6 value estimated for Group I should 
be applicable to IIp. Using .50, .30, 
and .018 as the values of j(0), w, and 
6, respectively, we have computed a 
theoretical curve for Series IIp, and 
this is plotted in Fig. 1. Similarly, the 
6 value estimated for Series III, should 
apply also to IIIp, and we have used 
this value, .08, together with .30 for x 
and .85 for #(0) to compute the pre- 
dicted curve for III, shown in Fig. 1. 
Considering that no degrees of free- 
dom in the Series B data have been 
utilized in curve fitting, the corre- 
spondence between the theoretical and 








PREDICTIONS 





40- 

















© 80 160 2400 80 160 2400 80 160 2400 80 160 240 


TRIALS 


Fic. 2. Empirical and theoretical cumulative response curves for individual Ss of Group I 
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empirical curves does not seem bad. 
The reason for some of the irregulari- 
ties will be brought out in the next 
section. A statistical test of one 
aspect of the correspondence can be 
obtained by calculating for each theo- 
retical curve a predicted mean total 
of A; responses in the second series, 
by means of Equation 5, and com- 
paring these values with the observed 
mean totals. This has been done and 
the comparison is given in Table 3. 
The ¢ values for differences between 
observed and theoretical values seem 
satisfactorily low. 

In order to give an idea of the extent 
to which the behavior of individual Ss 
conforms to the theoretical function, 
we have plotted in Fig. 2 the indi- 
vidual cumulative response curves for 
all Ss of Group I. The cumulative 
form was chosen for the smoothing 
effect, some of the noncumulative 
curves being too irregular for curve- 
fitting purposes. The theoretical 
curves in Fig. 2 represent Equation 7 
with @ values obtained by a method of 
approximation. Ten of the curves are 
fitted quite well by this function with 
x = .30 as the asymptote parameter. 
Four curves, Numbers 2, 11, 15, 16, 
require other values for this param- 
eter, viz., .075, .45, .24, and .18, respec- 
tively. Curves 3 and 4 deviate con- 
siderably from the theoretical form. 
In general, it appears that the empir- 
ical curves for most individual Ss can 
be described quite satisfactorily by 
the theoretical function, and this fact 
gives us some basis for inferring that 
in this situation mean learning curves 
for groups of Ss reflect the trend of 
individual learning uncomplicated by 
any gross artifacts of averaging. 

The effect of 120 reinforcements at a 
mw value of .50 may be evaluated by 
comparing curve forms and mean A; 
response totals for Series I, and IIs. 
We find that the reinforcements lead 


to no increase in resistance to change. 
Slopes of the two curves are very 
similar and the response totals do not 
differ significantly. This result is in 
line with predictions from the statis- 
tical model, but a little surprising, 
perhaps, from the viewpoint of Thorn- 
dikian or Hullian reinforcement the- 
ory since partial reinforcement has 
generally (6) been held to increase 
resistance to extinction in this 
situation. 

The conclusions from our study of 
the mean learning curves would seem 
to be (a) that under some circum- 
stances at least it is possible to evalu- 
ate theoretical parameters from the 
data of one series of learning trials and 
then to predict the course of learning 
in a new series; and (bd) that the rate 
at which the mean learning curve 
approaches its asymptote depends, in 


an as yet incompletely specified man-, 


ner, upon the difference between initial 
response probability and the proba- 
bility of reinforcement obtaining dur- 
ing the series. 

Sequence effects —The mean curves 
studied in the preceding section may 
not reflect adequately all of the learn- 
ing that went on during the experi- 
ment. The irregularities in some of 
the mean curves of Fig. 1 might be 
accounted for if there is a significant 
tendency for Ss’ response sequences to 
follow the vagaries of the sequences of 
E,’s and E,’s. To check on this possi- 
bility we have plotted in Fig. 3 the 
mean proportions of A; responses vs. 
frequencies of E, occurrences per 10- 
trial block for all groups in Series B. 
In preparing this graph, the 120 trials 
of Series B were divided into 12 suc- 
cessive blocks of 10. Since there were 
48 Ss, there were 576 of these trial 
blocks and they were classified accord- 
ing to the number of E, occurrences 
in a block. Then for the set of all 
blocks in which no E;’s occurred, the 
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mean proportion of A; responses was 
computed and entered as the first 
point in Fig. 3, and so on for the 
remaining points. It seems clear that 
Ss were responding to the particular 
sequences of E,’s and E,’s, and not 
simply to the over-all rate. Corre- 
sponding graphs for the three groups 
in the first series had somewhat shal- 
lower slopes; they have not been 
reported since some of the individual 
points were based on too few cases to 
be reliable and the groups could not 
be averaged together in the first series 
owing to the different + values. 

In order to deal statistically with 
this apparent dependence of response 
tendency upon the density of E, occur- 
rences in the immediately preceding 
sequence, we have computed for each 
series the average probability, pa, jz,, 
that an A, occurs on Trial n given that 
an E; occurs on Trial » — 1 and the 
average probability, fa,jz,, that an 
A; occurs on Trial n given that an E, 
occurs on Trial »—1. The differ- 
ence between these two quantities can 
be shown to be proportional to the 
point correlation (7) between A(n) 
and E(m—1) for a given series. 
Furthermore our Equations 1 and 2 
may be regarded as theoretical expres- 
sions for the two conditional proba- 
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TABLE 4 


Mean DirFeRENCES BETWEEN OBSERVED 
VALUES OF Pa,E, AND Paik, 
ror Eacu Series 











Series Group I Group II Group III 
A 128 199 153 
B .214 294 .231 














bilities, pa,jz, and pa,jz,, respectively, 
and it will be seen that if these expres- 
sions are averaged over all values of n 
in a series and the second subtracted 
from the first, the difference is equal 
to the parameter 9, i.e., 


(1 — 6)p(n) + 0 — (1 — 0)p(n) = 8. 


Thus from the statistical model we 
must predict that the difference be- 
tween empirical estimates of these 
conditional probabilities for any series 
should be positive and, if successive 
trials are independent, this difference 
should be equal to the value of @ esti- 
mated from the mean response curve. 
The conditional probabilities have 
been computed from the data for each 
S and mean differences by groups are 
summarized in Table 4. 

All of the differences are positive 
and significant at better than the .001 
level of confidence. The differences 
among group means are insignificant 
for both series (F’s equal to .45 and 
.73, respectively) as are differences 
among subgroup means. The increases 
from the first series to the second are, 
however, significant beyond the .005 
level. The latter effect was not antici- 
pated on theoretical grounds; the 
most plausible explanation that has 
occurred to us is that alternation 
tendencies associated with previously 
established guessing habits extin- 
guished during the early part of the 
experiment. This hypothesis would 
also account for the high P(1) value 
observed for Group I in Fig. 1. 

Although all of the quantities in 
Table 4 are positive and apparently 
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independent of , as required by the 
theory, the numerical values are all 
larger than the @ estimates obtained 
from mean response curves. The most 
straightforward interpretation of this 
disparity would be that, owing to the 
short intertrial interval, successive 
trials are not independent in the sense 
required by the theoretical model. 
Nonindependence would have at least 
two immediate consequences in so far 
as the present experiment is concerned. 
First, stimulus samples drawn on suc- 
cessive trials would overlap, and the 
learning that occurred on one trial 
would affect behavior on the next to a 
greater extent than random sampling 
would allow for, thus increasing pa,z,, 
and decreasing fa,z,. Second, the re- 
inforcing stimulus of one trial, E; or 
E2, would be part of the stimulus com- 
plex effective at the beginning of the 
next trial. If this interpretation is 
correct, then more widely spaced trials 
should result in better agreement be- 
tween the alternative estimates of @ 
and also in reduction of the depend- 
ence of mean learning rate upon prob- 
ability of reinforcement. 


SUMMARY 


Learning rates, asymptotic behavior, and 
sequential properties of response in a verbal con- 
ditioning situation were studied in relation to 
predictions from statistical learning theory. 

Forty-eight college students were run in an 
individualized modification of the “verbal condi- 
tioning” experiment originated by Humphreys 
(4). Each trial consisted in presentation of a 
signal followed by a left-hand or right-hand 
“reinforcing” light; S operated an appropriate 
key to indicate his prediction as to which light 
would appear on each trial. For each S one of 
the lights, selected randomly, was designated as 
E,, the other as Ey. On the first series of 120 
trials, E, occurred with probability .30, .50, and 
.85 for Groups I, II, and III, respectively. On 
the second 120 trials, E,; occurred with proba- 
bility .30 for all groups. 

Theoretical predictions were that mean proba- 
bility of predicting E; should tend asymptoti- 
cally to the actual probability of E:, both during 
original learning and following a shift in proba- 


bility of reinforcement; and that response 
probabilities should change in accordance with 
exponential functions, learning rates (as meas- 
ured by slope parameters) being independent of 
both initial condition and probability of rein- 
forcement. 

The statistical criterion for approach to theo- 
retical asymptote was met by Group I by the 
erid of the second series and by Group III in 
both first and second series. In the second 
series, Group II was short of theoretical asymp- 
tote but reached the same response probability 
as had Group I during the first series. 

Learning rates were virtually identical for 
Group I, first series, and Group II, second series, 
indicating that resistance of response probability 
to change is not altered by 50% random rein- 
forcement in this situation. Learning rates dif- 
fered significantly among groups within both 
series. In general, learning rate was directly 
related to difference between initial response 
probability and probability of reinforcement 
during a series. It was suggested that this rela- 
tionship may depend upon temporal massing of 
trials. Not only group means, but individual 
learning curves could be described satisfactorily 
by theoretical functions. 

No tendency was observed for Ss to respond 
to a series as a whole. On the contrary, sensi- 
tivity to effects of individual reinforcements and 
nonreinforcements (E; and Eg occurrences) in- 
creased significantly as a function of trials. 
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EFFECT OF NUMBER OF CHOICES PER UNIT OF A 
VERBAL MAZE ON LEARNING AND 
SERIAL POSITION ERRORS! 

W. J. BROGDEN AND ROBERT E. SCHMIDT 


University of Wisconsin 


The experiment to be reported in 
this paper was designed as an investi- 
gation on the acquisition of spatial 
orientation. The Ss were asked to 
assume movement within a spatial 
maze as they responded with words 
that represent movement in space. 
Mazes of 16 units in length, with 2, 3, 
4, 6, or 8 alternate choices per unit 
were used. The mazes will differ in 
spatial complexity only if S is able to 
assume movement in space as he 
makes the required responses. Since 
the Ss failed in achieving this attitude, 
the task becomes one of nonspatial 
serial verbal learning. 

The experiment, then, provides for 
a determination of the relation be- 
tween acquisition of a verbal maze and 
the number of alternate choices per 
unit. This variable is representative 
of the dimension of amount of discov- 
ery as defined by Melton (3). With 
other conditions equal, difficulty of 
acquisition should increase with in- 
crease in amount of discovery required. 
Riley (4) has demonstrated this to be 
so for the acquisition of a list of eight 
paired associates where the correct 
response was discovered among 1, 2, 
or4alternatives. The present experi- 
ment extends the range of amount of 
discovery and provides information 
on the relation of this variable to 
difficulty of acquisition with param- 
eters of learning different from those 
present in the study by Riley. 


1 This research was supported in part by a 
grant from the University Research Committee 
from funds provided by the Wisconsin Alumni 
Research Foundation. 


235 


PROCEDURE 


Twenty Ss in each of five groups practiced a 
16-unit verbal maze until the criterial perform- 
ance of one perfect repetition was attained. 
The number of choice points per unit was 2, 3, 
4, 6, and 8, respectively, for the five groups. 
The Ss were members of the classes in elementary 
psychology at the University of Wisconsin who 
volunteered to serve in an experiment on learning 
a mental maze. Of the total 100 Ss, 76 were 
women and 24 were men. The N of 20 for each 
of the five groups was filled simultaneously by 
random assignment of Ss to each group or sub- 
group. 

Conditions.—Each unit of Maze 1, the two- 
choice maze learned by Group I, involved a 
choice of right (R) or left (L). For Maze 2, the 
three choices were R, L, and up (U), or R, L, and 
down (D). Half of the Ss in Group II (Sub- 
group IIa) was given the choices R, L, and U, 
and the other half (Subgroup IIb) was given the 
choices R, L, and D. The pattern for Maze 2 
was identical in these two cases except for the 
direct interchange of the choices, U and D. In 
Maze 3, the Ss of Group III had the four choices 
of R, L, U, and D for each unit. Group IV was 
divided into two equal subgroups, of which IVa 
had the choices R, L, U, D, right-up (RU), and 
left-up (LU); Subgroup IVb was given the 
choices R, L, U, D, right-down (RD), and left- 
down (LD). The pattern of Maze 4 was iden- 
tical for these two subgroups except for the direct 
interchange of the choices RU and RD with the 
choices LU and LD, respectively. The eight 
choices per unit in Maze 5 were R, L, U, D, RU, 
LU, RD, LD. Table 1 presents the pattern of 
correct choice for each of the mazes. The 
sequences of correct choice were obtained by use 
of a table of random numbers with the restric- 
tions that the first choice be R and that there be 
an equal number of each kind of choice. Since 
the latter condition could not be met completely 
for Maze 2 (three-choice) and Maze 4 (six- 
choice), equality for kind of choice was held to 
five for Maze 2 and two for Maze 4; the correct 
choices for the remaining units were filled at 
random with the requirement that a given kind 
of choice not appear more than once. 

Instructions.—Models of one unit of each of 
the five mazes were constructed from Tinker 
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TABLE 1 
Maze Patrerns 
Units of Maze 

Group Maze is wee 

Number | Choices 
1}/2/3)4]5/]6/]7] 8 | 9 | 10 | 11 | 12] 13 | 14] 15 | 16 
I 1 2 RILj|LIRIR{IRILIR{IRIL IL ILIR IL {RIL 
IIa 2a 3 RiU/IL |L{IR|IRIU/IJU|{|LIR IL JU/IL IL [RIU 
IIb 2b 3 R/ID|ILILIR|IRIDID{IL IR IL |DIL IL {RID 
III 3 + RID; RIDILJU;/JRIR {IL ID IU JLIL |U |UD 
IVa 4a 6 R}L|RUR|U/|D/D{| RU) RUD |LU;|L/IL |R |U|LU 
IVb 4b 6 R}L|RD/R|U/|D|D|RD|/ RDID |LD/L/|L |R |UJsLD 
Vv 5 8 R/}U|L |U|D/D|R]{RD/} RD|LU|LU|L|LD|LD| RU RU 



























































Toy parts. A shaft inserted in the center hole 
of a circular hub represented one unit of distance 
forward. The hub represented the choice point, 
and shafts inserted around the circumference of 
the hub represented the alternative choices. The 
angular location of shafts and their designation 
in terms of verbal response were as follows: 
0° =U; 45°=RU; 90°=R; 135° =RD; 
180° = D; 225° = LD; 270° = L; 315° = LU. 
Cardboard tabs with the appropriate words 
printed on them were inserted at the end of each 
of the shafts attached to the hub circumference. 
The E used the models in giving instructions to 
S. 
The S and E sat in chairs located at a table 
in the experimental room with S always seated 
to the right of E. The instructions were read by 
E, who also used the appropriate model of the 
maze unit and pointed to its parts when perti- 
nent to the instructions. The instructions dif- 
fered for the five maze patterns only in terms of 
the number and kinds of choices. They asked S 
to assume presence at a given point with the 
intention of proceeding in a forward direction, 
as rapidly as possible, to some other point. 
Mention was made of the possible choices and 
the model of one unit was demonstrated to show 
forward movement along the horizontal shaft to 
the hub or choice point. The alternate choices 
were again named and their spatial locations indi- 
cated on the model. Behavior for both correct 
and incorrect responses was demonstrated. If 
the E said “forward” following a response, S was 
to enter the next unit and make response from 
the alternate choices. The end of the maze or 
trial was indicated by the word “stop” spoken 
by EZ. A blindfold, composed of a pair of goggles 
behind each lens of which was placed sterile, 
absorbent cotton, was adjusted over the eyes. 
At this point, the instructions were presented 
again, but the model was not used. Any ques- 
tions relating to clarification of the instructions 
were’answered and the first trial was then started. 
Measures —The E used two stop: watches, 
one to measure the time consumed by each trial, 


and the other to maintain a 30-sec. interval be- 
tween each trial. Data sheets were prepared for 
each maze pattern with the letter symbols of the 
true pathway printed thereon. The E recorded 
each error by letter symbol and in terms of maze 
locus and sequence of error. 

Terminal questions—When S had completed 
the criterial trial, the blindfold was removed, and 
he was asked to answer the following questions: 
(a) How many units were there in the maze? 
(b) How many times was each response used in 
the maze pattern? (c) Can you now give the 
correct maze pattern? (d) Were you able to 
visualize the correct maze pathway as you were 
learning the maze? The S was cautioned not to 
attempt answer of a and b by a count in review 
of the true pathway. ‘The answers to these ques- 
tions were recorded by E. 


RESULTS 


The standard measures of maze 
learning, trials to the criterion, total 
errors to the criterion, and total time 
to the criterion are available as meas- 
ures of acquisition. The number of 
trials to the criterion, the total errors 
to the criterion, and the total time in 
seconds to the criterion were tabulated 
for each S in each of the five groups. 
An initial examination of these data 
was made for Subgroups IIa and IIb 
and for Subgroups IVa andIVb. The 
mazes for the a and b subgroups dif- 
fered from each other only in terms 
of one or two response words, and not 
in terms of pattern or in terms of 
number of alternate choices per unit. 
Since no significant differences were 
found, the data for the a and b sub- 
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unit verbal maze and number of alternate 
choices per unit 


groups were pooled and are presented 
only for the over-all groups, II and 
IV. An analysis of variance was per- 
formed on the data for each of the 
three measures. The F ratio for the 
between- and within-groups variances 
is 2.68 for trials to the criterion, 33.33 
for total errors to the criterion, and 
22.13 for the total time to the crite- 
rion. The value of F at the 5% level 
of confidence is 2.52. The group 
means for trials to the criterion are 
10.7, 13.3, 12.8, 13.5, and 9.8, respec- 
tively, for the five groups. No syste- 
matic relation exists between trials to 
the criterion and number of alternate 
choices per choice point. The group 
means for total errors and for total 
time, on the other hand, show a pro- 
gressive increase as the number of 
alternate choices per unit increases. 
The relations between the means for 
errors and time to number of alternate 
choices per choice point were tested 


for deviation from linear regression 
(6). In neither case was there signifi- 
cant deviation from linear regression. 
Therefore, regression equations of the 
straight line function, y = mx + }, 
were fitted by the method of least 
squares to the empirical data. These 
equations are: y = 115.4x + 157.4 for 
time and y = 34.1x — 12.8 for errors. 
Figure | presents plots of these curves 
and of the means for time and errors. 

The present experiment also pro- 
vides information about the relation 
between the elimination of errors as a 
function of serial position within the 
maze and the number of alternate 
choices per maze unit. Total errors 
to the criterion were tabulated for 
each S for each of the 16 units in the 
maze and ari analysis of variance was 
performed on these data. The F for 
groups was 216.82, for units of the 
maze was 34.02, and for the interac- 
tion of groups by units was 6.41. 
Each of these F ratios is significant at 
better than the 5% level of confidence. 
The significant F for groups indicates 
that more errors are made by Ss as 
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the number of choice points per maze 
unit increases. The one for units of 
the maze indicates that frequency of 
error varies with serial position of 
maze unit. The significant interac- 
tion term shows that the serial posi- 
tion error curve differs as a function 
of the number of alternate choices per 
maze unit. These results are shown 
in Fig. 2 where group mean frequency 
of total error is plotted as a function 
of unit position within the maze for 
each group. Each of the five curves 
was smoothed by taking a rolling 
average of “threes.” The extent of 
bowing of the serial position curve in- 
creases as the number of alternate 
choices per unit increases. The maxi- 
mum frequency of error occurs beyond 
the middle of the maze for all groups 
except Group V. The curve for this 
group is bimodal with a maximum fre- 
quency of error at Unit 7, but with a 
secondary peak at Unit 5. Another 
feature of these curves is their asym- 
metry with respect to the initial and 
terminal units of the maze. For all 
groups there is a greater frequency of 
error at the end of the maze than at 
the beginning of the maze. The mag- 
nitude of this difference appears to 
increase as the number of alternate 
choice points per unit increases. 

The questions asked of the Ss at the 
close of the experiment brought the 
following results. To the question, 
“How many units were there in the 
maze?” 29% of all Ss gave the correct 
answer; 52% underestimated; and 
19% overestimated. There was little 
difference in the distribution of esti- 
mates between the five groups. To 
the question, “How many times was 
each response used in the maze?” the 
frequency of correct response in- 
creased as the number of alternate 
choices per unit increased, going from 
20% for the 2-choice group to 90% 
for the 8-choice group. To the ques- 
tion, “Can you now give the correct 


maze pattern?” 70% of the Ss in the 
2- and 4-choice groups, and 75% in 
the 8-choice group were able to give 
the correct maze pattern. For the 
3- and 5-choice groups, only 40% and 
30% of the Ss, respectively, were able 
to give the correct maze patterns. To 
the final question, “Were you able to 
visualize the correct maze pathway as 
you were learning the maze?” only 
18% of the Ss replied positively. 


Discussion 


The failure of trials to the criterion to 
have any systematic relation with num- 
ber of alternate choices is probably due 
to the differences that occur between 
trials on mazes for which there are differ- 
ent numbers of alternate choices per unit. 
Learning of spatial mazes requires the 
use of the correction method. The S§ 
must continue responding at each choice 
point until he makes a correct response. 
Only then is he permitted to proceed to 
the next choice point. Thus, trials on 
mazes with different numbers of alternate 
choices are not comparable. If the non- 
correction method were used, trials would 
be an appropriate measure of acquisition 
where number of alternate choices is the 
independent variable. In the study by 
Riley (4), the noncorrection method was 
used and trials to the criterion show a 
progressive increase from 1 to 2 to 4 
alternate choices. 

In the present study, the data for total 
errors and total time to the criterion show 
a progressive increase with increase in 
the number of alternate choices per maze 
unit. The two functions are essentially 
the same. What may be said about the 
one, may also be said of the other. With 
the 2-choice maze as the base, diffi- 
culty of acquisition becomes dispropor- 
tionately less difficult with increase in the 
number of alternate choices. Increase in 
the number of alternate choices per 
choice point is perhaps comparable to 
increase in the length of list, quantity of 
material, or length of a maze. Studies 
of this kind of variable, however, show a 
disproportionate increase in difficulty of 
acquisition with its increase. This result 
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has been obtained with verbal learning 
where length of list or quantity of mate- 
rial is increased (1), with learning of 
finger mazes where length of the maze is 
increased (5), and with learning the 
Peterson rational learning problem when 
the number of letters is increased (2). 

A similar relation exists between num- 
ber of alternate choices per maze unit 
and number of errors expected by chance 
on the first trial. This number is 8 for 
the 2-choice maze, 16 for the 3-choice 
maze, 24 for the 4-choice maze, 40 for 
the 6-choice maze, and 56 for the 8- 
choice maze. The equation for this 
straight line function is y = 8x — 8. The 
actual group means for errors on the first 
trial are 7.00, 15.25, 22.55, 39.25, and 
63.60 for the 2-, 3-, 4, 6-, and 8-choice 
mazes, respectively. The least-square 
regression equation for these data is 
y = 9.3x — 131. One assumption made 
in determining the number of errors ex- 
pected by chance on the first trial is that 
no error will be repeated for any given 
choice point. This is a reasonable as- 
sumption since S§ was informed of the 
number and kind of responses he was to 
make before practice on the maze was 
started. Examination of the data reveals 
no repetition errors for Ss who learned 
the 2-, 3-, or 4-choice mazes. Two repet- 
itive errors were made in the 6-choice 
maze and 95 repetitions were made on the 
8-choice maze. If the repetitive errors 
are subtracted from the total errors on 
the first trial for each S§ in the 6- and 8- 
choice maze groups, the means are 39.15 
and 58.85, respectively. The regression 
equation for the adjusted data is then 
y = 8.6x — 10.9. The adjusted data are 
in close agreement with chance expect- 
ancy of errors on the first trial. The 
difference between these functions and 
that for all of the data represents a differ- 
ence in relative maze difficulty that, on 
the one hand, shows a disproportionate 
increase in difficulty as number of alter- 
nate choices increases and, on the other, 
shows an increase in difficulty as number 
of alternate choices increases that de- 
creases disproportionately. Since the 
former function is included in the latter, 
it follows that relative difficulty between 
the mazes decreases as practice progresses. 


Amount of discovery makes its greatest 
contribution to performance on the first 
trial; thereafter, its effect declines with 
further practice. 

There is a characteristic of the inde- 
pendent variable other than number of 
alternate choices per choice point that 
may be important in producing the de- 
crease in relative difficulty of the mazes 
as learning progresses. When maze 
length is held constant as the number of 
alternate choices per choice point is in- 
creased, the number of like responses in 
the correct maze pattern must decrease. 
For the 2-choice maze there are 8 R and 
8 L responses in the true pathway; for 
the 3-choice maze, the correct path con- 
tains 5+ of each of the three responses; 
for the 4-choice maze, the number of like 
responses is 4; for the 6-choice maze, it is 
2.66; and for the 8-choice maze it is 2. 
A decrease in the number of like responses 
in the correct pathway may lessen diffi- 
culty in learning through an increase of 
discrimination between the 16 units of 
the maze and a decrease in interference 
effects between the units. If this factor 
exists, it can affect learning only after 
some degree of acquisition of the maze 
pattern has occurred. Its effect pre- 
sumably would be greatest in the later 
stages of acquisition. Upon this theo- 
retical basis, therefore, the primary effect 
of number of alternate choices per choice 
point is assigned to the early stages of 
acquisition, the effect of number of like 
responses in the correct pathway is as- 
signed to the later stages of acquisition, 
and the disproportionate decrease in in- 
crements of maze difficulty of the total 
time and total error functions is attrib- 
uted primarily to the effect of the number 
of like responses in the correct maze path. 

Number of like responses in the correct 
pathway is an aspect of maze pattern 
that may have a considerable influence 
upon the relation of acquisition to num- 
ber of alternate choices per choice point. 
Although number of like responses is 
inversely related to number of alternate 
choices in the present experiment, it is 
possible to make them independent. 
Experimental variation of number of like 
responses could then be applied to all 
possible choice responses of the maze and 
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to possible choice responses in successive 
units of the maze or blocks of units, as 
well as to the correct responses. Consid- 
eration of these dimensions would also 
require that consideration be given to the 
sequence of the correct responses. In 
the present experiment, maze difficulty 
is in part a function of the particular 
sequence of correct responses selected at 
random. Manipulation of each of these 
variables might be expected to affect 
acquisition and thereby alter relative 
maze difficulty and the functional rela- 
tion of acquisition to number of alternate 
choices per choice point. 

Note should be made also of the pos- 
sible effect of further increase in the mag- 
nitude of the independent variable upon 
relative difficulty of maze acquisition. 
From a theoretical point of view, there is 
an upper limit to the amount of material, 
the length of a maze, or the number of 
alternate choices per choice point of a 
maze that can be learned as a unit of 
serially presented material by the whole 
method. With the kind of verbal maze 
used in the present experiment, it would 
be necessary to increase the number of 
units (or length of the verbal maze) in 
order to be able to increase the number 
of alternate choices per unit. If this 
were done to a sufficient extent, a point 
of inflection should be found beyond 
which there would be a nonlinear accel- 
eration of the function to the value of the 
independent variable at which learning 
to the criterion will not occur. 

The relations between the form of the 
serial position error curve and number of 
alternate choices per choice point cannot 
be given quantitative expression, even 
though the increase in extent of bowing is 
clearly evident. However, it should be 
noted that the characteristics of these 
curves are comparable to each other and 
to those for serial position error curves 
based on learning data for serial verbal 
material and spatial mazes. 


SUMMARY 


Five groups of Ss learned a 16-unit verbal 
maze for which the number of alternate choices 
per unit were 2, 3,4,6,or8. Rate of,acquisition 


measured in terms of mean total time and mean 
total errors to the criterion of one perfect repeti- 
tion increased with increase in the number of 
alternate choices per choice point. Analysis of 
variance indicates a significant effect of the inde- 
pendent variable on both measures of acquisition. 
Tests of deviation from linear regression reveal 
no deviation. Therefore, straight line regression 
equations were fitted to the data by the method 
of least squares. ‘These functions are y = 34.1x 
— 12.8 for errors and y = 115.4% + 157.4 for 
time in seconds, 

Theoretical discussion of these functions sug- 
gests that in the determination of maze difficulty 
the effect of number of alternate choices per 
choice point occurs early in acquisition ; the effect 
of number of like responses in the correct path- 
way occurs later in acquisition; and the num- 
ber of like responses is primarily responsible 
for the disproportionate decrease in the incre- 
ments of maze difficulty. 

Serial position error curves were constructed 
from the data for each group, and extent of bow- 
ing of these curves was found to increase with 
increase in the number of alternate choices per 
choice point. An analysis of variance shows this 
relation is significant statistically. Other char- 
acteristics of the serial position error curves are 
comparable to those obtained in studies of serial 
verbal learning and maze learning. 
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CLASSIFICATORY CATEGORIES! 
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University of California* 


Some psychologists are interested in 
the organism’s processing of informa- 
tion—how it is received or selected, 
organized, stored, made available 
when needed. Categories are ‘impor- 
tant in the processing of information 
because they establish equivalences in 
our Heraclitian world, making it pos- 
sible for the organism to profit by 
experience and react at a better-than- 
chance level with the environment (3, 
8, 12). 

It seems evident that the process of 
categorizing is itself one determinant 
of which previous experiences are 
brought to bear on subsequent occa- 
sions. That the use of a particular 
category influences the content of the 
information that is recalled has been 
demonstrated in many ways (1, 4). 
However, the influences of formal 
characteristics of categories, such as 
how many there are, their distinct- 
ness, exclusiveness, “ecological valid- 
ity,” generality, etc., have been studied 
very little. 

The problem of the present study is 
the apparently simple one of how mere 
number of categories affects the amount 
of information. that can be recalled. 
A further problem is how such effects 
can be explained in terms of more 
molecular psychological concepts con- 
cerning cognitive processes. 


An illustration may clarify the presentation 
of the issue. We have found it harder to remem- 
ber the names of our students when the class is 


1 This paper is based on a dissertation sub- 
mitted in partial fulfillment of the requirements 
for the Ph.D. degree at the University of Cali- 
fornia. The writer is indebted to Dr. Warner 
Brown, thesis director, and Dr. Leo Postman. 

? Now at Smith College. 
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either all male or all female than when the class 
is mixed—when there are two categories. What 
if there were classes with 3 sexes, or 4, or 20? 
Would it be easier and easier to remember names 
then? Of course, certain conditions must be 
specified. Let us assume that the categories are 
distinct, that S will have no problem in remem- 
bering what the categories are, that an equal 
number of cases will fall into each category, and 
that each case falls into its category unambigu- 
ously. Also though the above example suggests 
the method of stimulus-response pairs, let us 
assume the method of retained members. 


An experiment designed to test the 
effect of the number of categories on 
recall becomes essentially another 
study of the effect of similarity on 
recall. This is so because items do not 
belong to the same category unless 
they are in some respect similar. 
Therefore, with a total number of 
items that is fixed, increasing the num- 
ber of categories means decreasing the 
number of similar items. Suppose 
there is one group of Ss who place 24 
items into two categories and another 
group who place 24 items into six cate- 
gories. Later they are asked to recall 
the items that they classified. Which 
group will recall more? 

Although Gestalt and S-R memory 
theorists would analyze this situation 
somewhat differently, the prediction 
as to outcome would probably be the 
same. If items that belong to the 
same category are similar, one might 
say that items within a category form 
a “trace aggregate” from which indi- 
vidual items will be increasingly diffi- 
cult to extricate as the size of the 
aggregate increases. Or one might 
say that the category name is attached 
to each of the items within it (con- 
stant stimulus, different responses), 
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and as the number of items within a 
category increases, the tendency to 
recall any one of the items suffers 
greater interference from the tend- 
encies to give all the other items at the 
same time. 

Our data do not furnish any test of 
the first formulation. The second is 
derived from Gibson (5), although she 
did not attempt to apply these con- 
cepts to the recall situation set up by 
the method of retained members, since 
it is not subject to detailed stimulus- 
response analysis. Also, in the case 
of stimulus-response pairs, only one of 
the presumably competing responses 
is correct for each stimulus; in the case 
of retained members, all of the com- 
peting responses are “correct.” Never- 
theless, others have assumed that an 
analysis in terms of intraserial inter- 
ference is applicable to this situation 
(6, 9, 10). 

Although perhaps oversimplified, 
this analysis is the basis for the hy- 
pothesis that increasing the number of 
categories will increase the number of 
items that can be recalled. 


METHOD 


Because of difficulties in designing one set of 
material which was equally well classifiable into 
2, 3, or 6 categories, the alternative plan was 
adopted of constructing several lists which were 
equivalent in familiarity but composed of items 
only in part the same. The categorizing scheme 
to be used by S was imposed upon him not only 
by the instructions but also by the selection of 
items on the list he received. 

The general procedure was to give S a list of 
names of famous people, each of which he was to 
classify as belonging to a certain category. After 
a 10-min. interval of interpolated activity, he 
was instructed to write as many of the names as 
he could remember. 

Subjects.—The Ss were beginning psychology 
students attending weekly section meetings in 
groups of 10-25 at the University of California. 
The standardization of the categories was based 
on 150 Ss, and 238 Ss served in the experiment 
proper. 

The categories and construction of the lists.— 
The categories were: Artist, Athlete, Conqueror, 
Musician, Poet, and Scientist. Lists of 24 names 


belonging to 2, 3, or 6 of these categories were 
constructed. A 2-category list had 12 items in 
each category, a 3-category list had 8 items in 
each category, and a 6-category list 4 items in 
each category. 

The design of the experiment called for sets of 
names which were equally familiar, whether they 
were to be classified into 2, 3, or 6 categories. 
The procedure for obtaining equated lists was as 
follows. Subjects from the same population as 
that to be tested in the experiment proper were 
given 5 min. in which to think of as many mem- 
bers of a given category, e.g., Artists, as they 
could. At the end of each minute they were told 
to draw a line beneath the last name they had 
written. This technique was taken from Bous- 
field and Sedgewick (2). When the first 5-min. 
interval was finished, in most cases Ss were given 
a second category. No group was given more 
than two categories. Between 45 and 50 Ss 
contributed names for each of the six categories. 

The Ss were told what was to be meant by the 
category names. Conquerors were to include 
noncontemporary military or naval figures; 
Musicians were to be restricted to serious com- 
posers of music; Athletes should include great 
professionals only, etc. The restrictions were 
E’s intuitive attempts to establish categories of 
comparable size. 

After the names were collected, master lists 
which gave the percentage of Ss mentioning each 
recurring name were prepared for all the cate- 
gories. From these master lists, the experimental 
series were built, the attempt being to construct 
lists in which the names were of comparable 
familiarity. 

Equating lists.—The criteria that were set up 
for equating the lists were (a) that the percentage 
of mention of names in each list should average 
the same; (b) that the percentage of mention of 
names in each category within a list should aver- 
age the same; and (c) that in each list the num- 
ber of names mentioned by high, medium, and 
low percentages of Ss should be the same. The 
lists show some deviations from these criteria. 
Nevertheless, there is evidence that the equating 
procedure was successful with respect to selection 
of names within categories. 

Since the 12 names of the 2-category group 
for any one category included all the names 
belonging to that category which were on both 
the 3- and 6-category lists, one can look to see 
whether the four Musicians, for example, that 
were on both the 6-category list and the 2-cate- 
gory list accounted for about 33% of the total 
number of Musicians recalled by Ss in the 2-cate- 
gory group. If these names accounted for more 
than 33%, it might be that the 6-category group 
had been given the advantage of especially 
familiar names, in spite of the efforts to control 
for familiarity, and would suggest that the con- 
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TABLE 1 


PERCENTAGE oF ToTat REcALL By 2-Ca*tEGoRY 
Group ContrisuTep By NAMEs on 3- 
anv 6—Catecory Lists 














+-Coseny Cypser 
ist 
Category (Rapected : (Expected: 
67%) 33%) 
Artist 69 33 
Athlete 66 29 
Conqueror & 35 
Musician 68 33 
et 75 44 
Scientist 65 ; 29 
Mean 67 34 











trol procedure was inadequate. Similarly, the 
eight Musicians common to the 3-category list 
and the 2-category list should account for about 
67% of the total number of Musicians recalled 
by the 2-category group. The same should hold 
for the other categories. 

Table 1 shows that this is the case. Only for 
Poets was there a sizable (positive) variation 
from the expected percentages. (One might ex- 
pect to find that the 2-category group would 
show a disadvantage in its recall of Poets. In- 
stead, later evidence leads us to the interpreta- 
tion that facilitation effects among the larger 
number of Poets of the 2-category group were 
responsible for a particularly high level of recall 
by that group of some names on the list.) 

One further problem in the equating procedure 
remained. Although certain Musicians and cer- 
tain Poets, for example, may be mentioned 
equally often, one does not know whether Musi- 
cians or Poets are in general easier to recall. 
Therefore, 2- and 3-category lists were made 
using all of the six classes of names. For the 
2-category lists, 7 of the 15 possible combinations 
of six categories taken two at a time were used. 
For the 3-category lists, 3 combinations of six 
categories taken three at a time were used. 

Table 2 gives the combinations of categories 
that were used, the average percentage of men- 
tion of the names on each list, and the number of 
Ss who received each list. Table 3 gives for 
illustrative purposes the names that were on the 
6-category list. 

Procedure.—The Ss received booklets with 
instructions on the first page. The instructions 
given to one of the 3-category groups were as 
follows : 


* Persons interested in the original tabulations 
of names and the other derived lists may obtain 
a microfilm copy of the author’s dissertation from 
the Library of the University of California. 


“Your first task will be to identify 24 people 
as belonging to one of three categories: Poet, 
Athlete, or Artist. These people will be listed, 
with a blank preceding each name. If you recog- 
nize a name to be that of a Poet, put a ‘P’ in the 
blank. Put an ‘Ar’ if the person is an Artist 
and an ‘Ath’ if the person is an Athlete. Be sure 
that you use these symbols and no others. 

“If you do not recognize a name, or if you 
recognize it but have no idea what category it 
belongs in, leave the blank empty. Under such 
circumstances it will not be to your advantage 
to guess. 

“Later you will be asked to give as many of 
these names as you can. However, you are not 
to try to memorize the list. Spend only as much 
time on it as is necessary to identify accurately 
the names you know. As soon as you have 
finished, tear off the list and hold it up. The 
experimenter will take it, and you will proceed 
to answer some questions on Maladjustment.” 

The list of names was on the back page of the 
booklet. With respect to category, the names 
were given in jumbled order. After S gave the 
back page to E£, he turned to the second page of 
the booklet and answered questions on the mate- 
rial that had been assigned for discussion in the 
quiz section that day. Since the last S turned in 


TABLE 2 


Summary oF ComBINATIONS OF CATEGORIES 
Usep 1 2-, 3-, anp 6-Catecory Lists; 
PEeRceNnTAGES OF MENTION.OF NAMES ON 
Teese Lists; anp NuMBER OF 
Ss Recervinc Eacu List 























Average 
Categories % Mention N 
of Names 
2 Categories 
Conqueror, Musician 36 37 
Artist, Athlete 36 16 
Poet, Scientist 36 13 
Artist, Conqueror 34 12 
Athlete, Poet 37 11 
Artist, Poet 34 ll 
Musician, Scientist 35 10 
Mean 36 110 
3 Categories 
Conqueror, Musician, 

Scientist 36 38 
Musician, Poet, Scientist 36 15 
Artist, Athlete, Poet 36 12 
Mean 36 65 
6 Categories 
Artist, Athlete, Con- 

ueror, Musician, Poet, 
ientist 38 63 
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his page of names within 1 min. after the first S 
had turned in his, the intervening interval was 
approximately the same for all Ss. This inter- 
val was 10 min. after the last S had finished 
classification. 

To ensure that a minimum number of items 
was recognized as belonging in each category by 
each S, data were not used if S failed to classify, 
or erred in classifying, 6 or more of the 24 names 
on the list. Nineteen per cent of the 6-category 
group were excluded on this basis, as opposed to 
10% of the 3-category group and 9% of the 
2-category group. Inclusion of these data would 
increase the variability of the results and lower 
the average recall scores of all groups. How- 
ever, an examination of the data showed that 
the differences among groups would not be 
affected. 

Recall instructions were as follows: “On the 
blank sheet of your booklet write as many of the 
names on the list you classified as you can 
remember. There were 24 names in all, and 
there were equal numbers in each category. If 
you have forgotten any of the categories, you 
may refer to your instruction sheet.” Five 
minutes were allowed for recall. 


RESULTS 


Equated lists of names were given 
different amounts of “similarity” by 
varying the number of categories into 
which the fixed number of names was 
placed. Having assumed that items 
placed in the same category are “‘sim- 
ilar,” we asserted that the effect of 
increasing the number of categories 
should be to reduce the amount of 
competition among the items. Ac- 
cording to hypothesis, then, the 6- 
category group should recall more 
names than the 3-category group, and 
the 3-category group should recall 
more names than the 2-category group. 
Furthermore, in most cases the per- 
centage of 6-category Ss recalling a 
specific name should be greater than 
the percentage of 3- and 2-category Ss 
recalling the same name. Finally, 
these results should be obtained not 
only in the gross averages based on the 
performances of all 6-, 3-, and 2-cate- 
gory Ss, but also in the averages based 
on their performances with respect to 


TABLE 3 


NaMEs*aND THEIR PERCENTAGES OF MENTION 
FOR THE 6-CaTecory List 


























Category | mention| Catesory | sertion 
Artists Musicians 
Michelangelo 60 | Bach 74 
Picasso 58 Debussy 38 
El Greco 18 Mendelssohn} 38 
Goya 12 Verdi 20 
Mean 37 Mean 43 
Athletes Poets 
Ruth 75 Poe 69 
Dempsey 31 Keats 32 
Hogan 20 | Chaucer 26 
Kramer 13 Sandburg 17 
Mean 35 Mean 36 
Conquerors Scientists 
Napoleon 88 Newton 51 
Genghis Khan 24 Curie 47 

. P. Jones 33 Galileo 29 

livar 10 Darwin 16 

Mean 39 | Mean 36 














Grand Mean = 38 





Artists alone and with respect to each 
of the other category types. 

Table 4 presents the gross averages 
and significance of differences. The 
difference between the 6- and 2-cate- 
gory groups is significant beyond the 
.001 level of confidence and the differ- 
ences between the 6- and 3-category 
groups and between the 3- and 2-cate- 
gory groups are significant by the one- 
tailed test beyond the .01 level of 
confidence. Thus the over-all results 
are according to hypothesis. 

The second part of the demonstra- 


TABLE 4 


Mean Recatu or Names By 2-, 3-, 
anp 6-CaTecory Groups 











Group N Mean t Dp 
2-category 110 | 11.75 | 2.61 (2-3) | .O1 
3-category 65 | 13.05 | 2.62 (3-6) | 01 
6-category 63 | 14.42 | 5.68 (2-6) | .001 
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tion of over-all differences is given by 
comparison of the recall of the same 
names by 2-, 3-, and 6-category Ss. 
Of the 24 names common to the 6- and 
the various 2-category lists, 79% were 
recalled by a higher proportion of 
6-category Ss. Of the 19 names com- 
mon to the 6- and 3-category groups, 
79% again were recalled by a higher 
proportion of the former. Of the 48 
names common to the 3- and 2-cate- 
gory groups, 63% were recalled by a 
higher proportion of the 3-category 
group. In general, the differences 
were greater for the less prominent 
names (Bolivar) than for the very 
prominent names (Napoleon). 

Another over-all result in accord 
with the hypothesis is furnished by the 
intrusion data. An intrusion was said 
to occur when S, in attempting to re- 
call the names that he had previously 
classified, included a name which had 
not actually been a member of his list. 
For example, a 6-category S who had 
classified the musicians Bach, Mendels- 
sohn, Debussy, and Verdi might dur- 
ing the recall period write the names 
of Beethoven, Mendelssohn, and De- 
bussy. Beethoven would be counted 
as an intrusion. Our hypothesis was 
that intrusions would increase as the 
number of categories decreased, be- 
cause the greater interference among 
the 12 items of the 2-category group, 
for example, would allow more opportu- 
nity for strong items not on the list to 
intrude themselves than in the case of 
the 4 items of the 6-category group. 

The results were that the 2-cate- 
gory group made an average of .75 
intrusions, the 3-category group an 
average of .51 intrusions, and the 
6-category group an average of .44 
intrusions. 

The third part of the analysis of 
differences in recall is presented in 
Table 5, which shows the means and 
significance of differences by specific 
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TABLE 5 


Recaut By Speciric Carecory ror 2-, 3-, AND 
6-Catecory Groups in AVERAGE 
TRANSFORMED PERCENTAGES 


























No. of 
Cc Cate- | M Com- 
ategory . ~~ ean | df* | tt | parison 

Artist 2 45.39, 49) — | 2-3 
3 43.70} 73} 1.71) 3-6 

6 53.57| 100 | 2.38} 2-6 

Athlete 2 43.32} 37| — | 2-3 
3 46.21) 73] 1.97) 3-6 

6 54.76, 88 | 3.02) 2-6 

Conqueror 2 | 43.69) 85 | 2.43) 2-3 
3 50.57; 99} 1.33) 3-6 

6 54.86] 110 | 4.54) 2-6 

Musician 2 42.04) 98 | 3.15) 2-3 
3 49.94 114] — | 3-6 

6 50.00} 108 | 2.76) 2-6 

Poet 2 47.144 6€| — | 2-3 
3 46.28} 88| — | 3-6 

6 48.81) 96| — | 2-6 

Scientist 2 44.82) 74) — | 2-3 
3 44.63) 114 2.61] 3-6 

6 51.79, 84] 1.90) 2-6 




















* The df figure is two less than the number of Ss of 
the two groups being compared who categorized names 
of the specific category under consideration. In the 
first instance, 16 2-category Ss categorized Artists and 
Athletes, 12 categorized Artists and Conquerors, and 
11 Artists and Poets, a total of 39. There were 12 
cacy Ss who categorized Artists. (See Table 2.) 
Thus df equal 49. 

+ By the one-tailed ¢ test for 60 or more df, a t of 1.671 
is significant at the 5% level of confidence, and a ¢ of 
2.390 is significant at the 1% level of confidence. 


category. Since the number of items 
in any one category was different for 
the 2-, 3-, and 6-category groups, recall 
percentages were obtained from the 
raw scores. As the estimate of vari- 
ance which we wished to use in the ¢ 
test assumes the same variance in both 
universes sampled, the arc sin trans- 
formation was used, whereby percent- 
ages are transformed into correspond- 
ing angles (11). 

Table 5 shows that of 18 possib! 
comparisons, 15 are to the advantage 
of the group with the larger number 
of categories. There are 10 differ- 
ences significant at not less than the 
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5% level of confidence, and all of these 
are in the predicted direction. 


However, there remain the compari- 
sons, particularly several between the 2- 
and 3-category groups, for which the 
advantage of a greater number of cate- 
gories was not demonstrated. These 
results may be attributed simply to 
“chance,” though several lines of evi- 
dence suggest that they are due to uncon- 
trolled differences in the amount of inter- 
ference which Ss encountered from the 
store of Poets, Artists, etc. which were 
not on the list. 

For example, a correlation of .714 
between relative size of “parent cate- 
gory” (estimated from the average num- 
ber of names which the preliminary group 
of Ss could think of in 5 min.) and the 
difference between the recall of the 2- and 
3-category groups shows that the 2-cate- 
gory group was inferior when the parent 
category was large but overtook the 
3-category group when the parent cate- 
gory was small. 

Our explanation of this result is that 
in the case of small parent categories the 
2-category group was released from most 
of the proactive inhibition which bur- 
dened the recall task of the 3- and 6-cate- 
gory groups. That is, the 12 Artists on 
the 2-category list comprised most of the 
Artists known to the Ss, so that they did 
not have as much problem in discrimi- 
nating correct items from incorrect items 
as did the 3- and 6-category Ss with 8 
and 4 Artists, respectively, on their lists. 
This explanation assumes amount of 
interference to be a negatively accelerated 
function of number of competing items, 
an assumption that is consistent with the 
fact that most of the preceding results 
show more difference between the 6- and 
3-category groups than between the 3- 
and 2-category groups. (Other findings 
about the relation of number of com- 
peting items to amount of interference 
are conflicting and not strictly relevant 
to this experiment.) 

Reduction of proactive inhibition 
would also go a long way toward account- 
ing for the results of an experimental 
variation in which a 1-category group of 


13 Ss was found to recall, not fewer Con- 
querors or Poets than the 2-category 
group, but just as many as the 6-cate- 
gory group. The 24 items on this list 
approximated being all the items of the 
category that S knew, so that interfer- 
ence from incorrect items was greatly 
reduced. An analysis of the role of pro- 
active inhibition in these experiments is 
presented more fully elsewhere (7). 

Another experimental variation was 
undertaken to show that “latent cate- 
gorizing,” which we believe occurs almost 
continually in everyday life, yields the 
same results as the explicit categorizing 
that was required of Ss in the main ex- 
periment. Two groups were given the 
same lists that the 2- and 6-category 
groups received but were asked only to 
check whether or not they were familiar 
with each name. The categories were 
not mentioned in the instructions. 

The results were quite similar to those 
obtained with explicit categorizing. 
Eleven 2-category Ss recalled an average 
of 10.82 names and 28 6-category Ss 
recalled an average of 14.12 names. This 
difference is significant at the .001 level 
of confidence. The 2-category list con- 
sisted of Conquerors and Musicians, for 
which previous results had yielded an 
average of 11.38 for 37 Ss. The 6-cate- 
gory list, as used in the main experiment, 
yielded a mean of 14.42 for 63 Ss. 


Discussion 


The results of this experiment have 
limited generality. The number of cate- 
gories was varied within a small range. 
Items were distributed equally within 
categories. The categories were distinct 
and there was no problem for S in remem- 
bering the categories. Preliminary ex- 
perimentation suggests that when these 
last two conditions do not apply, increas- 
ing the number of categories may even 
decrease the amount of information that 
can be recalled. Assuming the conditions 
of the experiment described here, how- 
ever, we conclude that increasing the 
number of categories will generally in- 
crease the amount that can be recalled. 

Understanding why this effect should 
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“generally” occur draws upon, and serves 
as a test of, our knowledge of recall proc- 
esses. Is what has been done here essen- 
tially a demonstration of the von Restorff 
phenomenon (13)? If the von Restorff 
phenomenon is to be considered as involv- 
ing figure-ground effects in memory, then 
this experiment does not seem to fit. It 
would be difficult to say which was figure 
and which ground in our “zebra’-like 
pattern, since the “stripes” were kept 
equal in size and varied together. 

We do interpret this experiment as 
having shown that different degrees of 
interference can be created by varying 
the number of items on a list that are 
similar in only those abstract attributes 
that make them members of the same 
category. However, that this interfer- 
ence comes only from competing tend- 
encies to respond among the experimental 
items, as the original “crude” analysis 
stated, was not proven. Since a sample 
of a population of familiar names served 
as materials, some of the interference 
certainly came from proactive inhibition. 
That is, the larger the number of items 
of the same category on the experimental 
list, the more did “recency” discrim- 
inanda, and other discriminanda ena- 
bling the S to distinguish experimental 
items from other “parent-category” 
items, generalize to items not on the list, 
so that the decrement in recall and the 
large number of intrusions resulted in 
part, at least, from greater difficulty in 
discriminating correct items. 


SUMMARY 


The problem investigated was how the num- 
ber of categories into which a body of items is 
classified affects the recall of the items. 

The Ss classified names of famous people into 
2,3, or 6 categories. The different lists of names 
used were equated on the basis of controlled 
association data. On recall tests 10 min. after 
the classification of the names, Ss who had been 
given 6 categories recalled significantly more 
names on the average than those who had been 
given 3 categories, who in turn recalled signifi- 
cantly more names on the average than those 
who had used 2 categories. Intrusions varied 
inversely with number of categories. 


The results are interpreted in terms of inter- 
ference among the items of the experimental list 
and between those on the list and other familiar 
items in the S’s “parent-categories.” 


REFERENCES 


1. Bartiett, F.C. Remembering: a study in 
experimental and social psychology. New 
York: Macmillan, 1932. 

2. Bousrretp, W. A., & Sepcewickx, C. H. W. 
An analysis of sequences of restricted 
associative responses. J. gen. Psychol., 
1944, 30, 149-165. 

3. Bruner, J. S.,& Austin,G. A. A theory of 
cognition. (Mimeographed.) 

4, CarmicnHac., L., Hocan, H. P., & WALTER, 
A.A. An experimental study of the effect 
of language on the reproduction of visu- 
ally perceived form. J. exp. Psychol., 
1932, 15, 73-86. 

5. Grsson, E. J. A systematic application of 
the concepts of generalization and differ- 
entiation to verbal learning. Psychol. 
Rev., 1940, 47, 196-229. 

6. McGeocn, J. A., & Inton, A. L. The psy- 
chology of human learning. New York: 
Longmans, Green, 1952. 

7. Matuews,R. Recall of items as a function 
of the number of categories into which 
they have been classified. Unpublished 
doctor’s dissertation, Univ. of California, 
1952. 

8. Postman, L. J. ‘Toward a general theory of 
cognition. In J. H. Rohrer & M. Sherif 
(Eds.), Social psychology at the crossroads. 
New York: Harper, 1951. Pp. 242-272. 

9. Saut, E. V., & Oscoop, C. E. Perceptual 
organization of materials as a factor influ- 
encing ease of learning and degree of 
retention. J. exp. Psychol., 1950, 40, 
372-379. 

10. Srecet, P. S. Structure effects within a 
memory series. J. exp. Psychol., 1943, 
33, 311-316. 

11. Snepecor, G. W. Statistical methods. Ames, 
Ia.: Iowa State Coll. Press, 1946. 

12. Toman, E. C., & Brunswix, E. The organ- 
ism and the causal texture of the environ- 
ment. Psychol. Rev., 1935, 42, 43-77. 

13. Von Restorrr, H. Ueber die Wirkung von 
Bereichsbildung im Spurenfeld. Psychol. 
Forsch., 1933, 18, 299-342. 


(Received July 6, 1953) 





Journal of Experimental Psychology 
Vol. 47, No. 4, 1954 


PERFORMANCE IN INSTRUMENTAL LEARNING AS A 
JOINT FUNCTION OF DELAY OF REINFORCEMENT 
AND TIME OF DEPRIVATION! 
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State University of Iowa 


Initial investigations of behavior 
often attempt to isolate the effects of 
single experimental variables. After 
two or more such influential factors 
have been identified, it becomes of 
interest to determine their joint effect, 
i.e., the manner in which such factors 
interact or combine to produce re- 
sponse strength. 

Two variables which have been 
observed to affect response speed in 
simple instrumental learning situa- 
tions are delay of reinforcement (7) 
(7, 8) and time of food deprivation 
(Tp) (1, 5, 6, 9). Hull (2) has 
assumed that Tg and Tp interact 
multiplicatively, whereas Spence (10) 
recently suggested, on the basis of 
certain initial evidence (6), that the 
relationship between Tg and Tp is 
additive. 

The present study is a further 
attempt to determine the manner in 
which 7g and Tp interact. Specifi- 
cally, it has investigated a particular 
response measure, speed of running, as 
a function of two periods of delay of 
reinforcement (1 and 5 sec.) and two 
periods of food deprivation or levels 
of drive (22 and 4 hr.). 


MeEtTHOD 


Subjects.—The Ss were 40 male hooded rats 
from the colony of the lowa Psychological Lab- 
oratory. All were experimentally naive and 
between 95 and 134 days old at the start of the 


1 The writer is indebted to Professor Kenneth 
W. Spence, who suggested the problem and 
supervised its investigation. 

2 Ist Lt., U.S. Army. Now Acting Military 
Chief, Human Research Unit No. 3, Office, Chief 
of Army Field Forces, Fort Benning, Georgia. 


experiment. Half of the animals were always 
run 22 hr. after feeding and the other half 4 hr. 
after feeding. Similarly, half of each drive group 
received reinforcement 1 sec. after responding, 
while the other half were reinforced after 5 sec. 
Thus, the four experimental groups may be iden- 
tified in terms of their respective degrees of each 
variable as follows: 22 hr.—1 sec., 22 hr.—5 sec., 
4 hr.—l sec., and 4 hr.—5 sec. 

In addition, Ss were assigned to ten levels 
according to their ages and litters, such that one 
S from each of the ten levels was in each experi- 
mental group. Other potential sources of vari- 
ance, such as living cage, feeding cage, and order 
of running, were confounded with this levels 
factor. 

Apparatus.—The apparatus used was a modi- 
fication of a particular type of Skinner box which 
has been described elsewhere (6,7). It consisted 
of a small, triangular-shaped response chamber 
with flat black, plywood walls and a brass floor. 
A starting alley led into the front apex through 
an outer metal door and an inner door of glass. 
A brass bar could be presented } in. through a 
hole in the opposite wall. Raising the glass door 
started a Standard Electric Timer. By touching 
the bar, S completed a low-ampere contact relay 
circuit (3) between the brass floor of the box and 
the bar. Closure of this circuit stopped the 
timer, retracted the bar, and fired a Hunter 
Decade Interval Timer (4). One or 5 sec. later, 
a .08-gm. pellet of food dropped into a food cup 
situated just below where the bar had been. A 
1000-cycle tone sounded concurrently with the 
release of the pellet and was turned off manually 
after S obtained the food. The apparatus was 
situated in a dark, sound-resistant room. A 
shielded 7-w. bulb was located above the food 
cup; the total illumination level in the response 
chamber was such that the food cup could not 
be seen by E (nor, presumably, by S) unless the 
7-w. light above it was on. 

Procedure.—All Ss were placed on a 22-hr. 
feeding schedule for seven days prior to prelimi- 
nary training, during which time they were fed 8 
gm. of dry Purina Laboratory Chow in individual 
feeding cages at the same hour every day. The 

22-hr. and 4-hr. groups were later fed at different 
times in order that they could be run at approxi- 
mately the same hour of the day. 
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Fic. 1. Running speed as a function of time 
of deprivation (Tp) and delay of reinforcement 
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Beginning on Day 8, each S received 1 hr. 
of preliminary training daily for three days in 
order to eliminate irrelevant exploratory beha- 
vior during learning, and to establish the second- 
arily reinforcing properties of the 1000-cycle 
tone. The bar was never present during this 
preliminary training. Each S was placed in the 
response chamber with the bar light on and a 
pellet of mash in the food cup. Thirty seconds 
after S obtained this food and was facing the 
food cup, another pellet was delivered, accom- 
panied by the sound of the 1000-cycle tone. The 
tone was turned off after S appeared to have 
eaten the pellet. In this manner 4 more 
pellets were given at approximately 30-sec. inter- 
vals, whereupon S was removed from the appa- 
ratus until the following day. Preliminary train- 
ing thus consisted of the receipt of 15 pellets of 
food, 5 per day. 

Training proper began the day after prelimi- 
nary training, and continued ry day there- 
after for 16 days. On each trial, S was placed 
in the starting alley and the metal door raised 
as soon as S faced the response chamber; 3 sec. 
later the glass door was raised, permitting en- 
trance into the response chamber. A total of 
104 trials was administered at the rate of four 
per day for the first six days, and eight per day 
thereafter. The average intertrial interval was 
5 min. Latencies were recorded to the nearest 
01 sec. on all trials. 


REsuLTs AND Discussion 


Performance curves for the four 
groups are shown in Fig. 1. Each 
point represents the mean of the 
speeds (reciprocal latencies) of ten Ss 
over eight trials. A mean “asymp- 

















TABLE 1 
Group Mean Response Speeps, 
Last 32 Triats 
Delay of 
Time of Reinforcement alae 
Deprivation Means 
1 sec. 5 sec. 
22 hr. 885 .709 .797 
4hr. .772 559 666 
Column means 829 634 














totic” response speed was calculated 


for each S by averaging his speeds on 


the last 32 trials. The group means 
of these values are shown in Table 1, 
and a summary table of an analysis of 
variance of this 2 X 2 factorial design 
is shown in Table 2. 

The short-delay groups ran faster 
(p<.02, single-tailed test) than did 
the long-delay groups, and the high- 
drive groups ran faster (p <.08, single- 
tailed test) than did the low-drive 
groups. More importantly, no signifi- 
cant interaction was found between 
the effects of drive and delay of rein- 
forcement. This latter result may be 
interpreted as questioning Hull’s as- 
sumption that 7g¢ and Tp interact 
multiplicatively, and as supporting 
Spence’s proposal of an additive rela- 
tionship between 7g and Tp. It 
should be borne in mind, however, 
that this finding of no interaction 
between 7'g and Tp may not be gen- 
eralized with any confidence beyond 
the particular values of each variable 
that were employed, i.e., 22-hr. and 


TABLE 2 


Anatysis oF VARIANCE oF Mean Response 
Speeps, Last 32 Triats 








Source af — F oe 


Deprivation (7'p) 1 |174,240 3.30) .08 
Delay (T@) 1 |378,691 7.17} .02 
Drive X delay 1 3,423 | <1 

Within cells 36 | 52,844 
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4-hr. food deprivation, and l-sec. and 
5-sec. delay of reinforcement. That 
the additive relationship probably 
does not hold throughout the entire 
ranges of these factors is clearly indi- 
cated by consideration of the joint 
effects of extreme values of the vari- 
ables. For example, performance does 
not improve under conditions of infi- 
nite delay of reinforcement no matter 
how great the time of deprivation em- 
ployed. Thus, it may be tentatively 
concluded that the manner in which 
Tg and Tp combine is more complex 
than either simple addition or multi- 
plication, but that it may resemble 
each of these forms of combination, 
depending upon the particular values 
of Tg and Tp. 

A three-dimensional analysis of 
these same data indicated that the 
variance due to the above-described 
levels factor (a confounding of age, 
litter, order of running, etc.) was too 
small to reduce the error term appre- 
ciably; hence the data were analyzed 
simply as above. This similar per- 
formance of the different levels may 
have been the result of a cancellation 
of opposite effects of the different com- 
ponents which together comprised the 
levels factor, or it may simply reflect 
the lack of influence of each compon- 
ent taken singly. In any case, the 
results suggest the inefficiency of con- 
stituting animal Ss into the type of 
levels used here. 


SUMMARY 


Forty male hooded rats were administered 
104 reinforced trials in a simple instrumental 
learning situation. Four groups each received 


one of two levels of the following experimental 
variables: delay of reinforcement (1 sec. vs. 5 
sec.) and time of food deprivation (22 hr. vs. 4 
hr.). 

The short-delay groups ran faster than the 
long-delay groups, and the high-drive groups ran 
faster than the low-drive groups. No significant 
interaction was found between the effects of 
delay of reinforcement and time of deprivation, 
a result which is contrary to Hull’s assumption 
that Tg and Tp combine multiplicatively, but 
consistent with Spence’s suggestion that these 
factors combine additively. 
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OVERTRAINING AND TRANSPOSITION IN CHILDREN 


HAROLD W. STEVENSON AND IRA ISCOE 
The University of Texas 


The problem investigated here has 
been discussed by Spence (9) and has 
received the attention of Jackson and 
collaborators in several studies (4, 5, 
6). On the basis of their studies with 
young children trained in the usual 
two-stimulus transposition situation, 
Jackson et al. (6) propose that with 
increasing amounts of overtraining 
unstable responses appear, followed by 
stable relative responses, and finally 
by stable absolute responses in which 
Ss choose the training stimulus rather 
than the new stimulus. The increase 
in frequency of relative responses after 
overtraining has also been reported by 
Hunter (3), and the effects of over- 
training have been offered by Jones 
and Dunn (7) as a possible explana- 
tion of the high relationship that they 
found between speed of learning and 
frequency of relative choice. The 
present study attempts to determine 
the effect of overtraining on transposi- 
tion in children who are older than 
those used in the above studies and 
who have been trained in a three- 
stimulus, rather than a two-stimulus, 
transposition situation. 


MeETHOD 


Subjects.—The Ss were 140 students in grades 
four through seven in two parochial schools in 
Austin, Texas, and ranged in age from 9.2 to 
14.2 yr 

Materials.—The apparatus consisted of a gray 
board containing three holes 1} in. apart into 
which gray boxes of the same shade could be 
fitted. Five boxes were used: three for the 
training trials and two additional ones for the 


1 We wish to acknowledge the kind coopera- 
tion of Sr. M. Bernarda of St. Mary’s School and 
Sr. M. Eugenia of St. Austin’s School in this 
study. 
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transposition trials. The dimensions of the 
boxes were 2} X 2 X lin. In the center of the 
lid of each box was mounted a yellow square, the 
stimulus to be discriminated. The sides of the 
stimulus squares on Boxes 1 through 5 were }, 
4, 4, 1, and 1} in. 

Procedure.—On the training trials Boxes 2, 3, 
and 4 were used for all Ss. The position of the 
boxes on the board was varied randomly from 
trial to trial, with the same random sequence 
being used for each S. The S was to find the 
box in which an eraser was hidden. For half of 
the Ss it was always to be found in Box 4, and 
for half in Box 2. These may be designated the 
Large and Small groups. ~ 

Two degrees of correct response were required : 
half of the Ss were required to learn to the cri- 
terion of four consecutive correct responses, while 
the other half had to make ten consecutive cor- 
rect responses. We may designate these the 4 
trial and 10-trial groups. As a result of this 
procedure four groups of 35 Ss each were formed: 
Large-4; Large-10; Small-4; and Small-10. 

When each S entered the testing room he was 
seated across the desk from E. At S’s right was 
a cardboard screen and in front of him was the 
board with the boxes. The S was told: “This is 
a game we’ve been playing with boxes. See, each 
box opens like this. Now see this [eraser]? I’m 
going to hide this in one of the boxes and I want 
to see if you can find it:’”” The E removed the 
board from S’s sight behind the screen and placed 
the eraser in one of the boxes. The board was 
then placed in front of S and he was told, “open 
the box that you think it is in.” After each 
choice the board was placed behind the screen 
and the boxes rearranged. The trials were con- 
tinued for S until he met the criterion to which 
he had been randomly assigned. 

The test trials followed immediately with no 
additional instructions or remarks. For the 
Large groups, Box 2 was removed and Box 5 
introduced; and for the Small groups, Box 4 was 
removed and Box | introduced. The test trials 
continued until four consecutive correct responses 
were made. The eraser was found in the largest 
or smallest box, depending upon whether S was 
in the Large or Small groups. When S met this 
criterion, a second set of transposition tests was 
given. For the Large groups Boxes 4 and 5 were 
removed and Boxes 1 and 2 introduced; for the 
Small groups Boxes 1 and 2 were removed and 
Boxes 4 and 5 introduced. Again, the eraser was 
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TABLE 1 
TRIALS REQUIRED to ATTAIN TRAINING CRITERION 
Age Trials to 4 Correct Trials | Trials to 10 Correct Trials 
Group N 

Mean SD Mean SD Mean SD 
Large-4 35 11.2 9 16.7 16.2 
Large-10 35 11.4 39 21.1 23.7 28.9 27.8 
Small-4 35 11.3 1.4 23.6 24.7 
Small-10 35 11.2 1.0 17.5 18.6 26.5 29.3 


























found in the largest or smallest boxes in this new 
sequence. 

At the conclusion of the experiment Ss were 
asked, “How did you know it was in that one?” 
and were urged to verbalize their basis of 
response. 


RESULTS 


Training trials —The mean number 
of trials required by all Ss to reach the 
4-trial criterion and for the overtrained 
Ss to reach the 10-trial criterion may 
be seen in Table 1. (Here, and 
throughout the discussion, the number 
of trials is exclusive of the criterion 
trials.) ‘The mean age for each group 
is also shown, indicating that as far as 
age is concerned, the four groups were 
quite homogeneous. 

The distributions of learning trials 
were all markedly skewed. Some Ss 
in each group solved the problem im- 
mediately by choosing the correct box 
on the first choice and continuing to 
choose it correctly to criterion, while 
others took up to 111 trials to reach 
the criterion. The great differences 
in ease of acquisition of a 4-trial cri- 
terion for all Ss are not, at least within 
the rather restricted age range used, 
related significantly to chronological 
age (r =—.11). Mental age ratings 
were not available. 

In order to determine the compar 
bility of the 4 and 10-trial groups ii 
their ability to acquire the size dis- 
crimination, we may combine the Ss 
according to learning criterion (Large- 
10 with Small-10 and Large-4 with 
Small-4) and test whether the mean 


number of trials required to attain a 
criterion of four consecutive correct 
responses differed significantly. The 
combined mean for the two 10-trial 
groups was 19.3, and 20.6 for the 4-trial 
groups, with corresponding standard 
deviations of 21.4 and 20.7. Applying 
Festinger’s technique (2) for evalu- 
ating the differences between means 
derived from skewed distributions, an 
F of 1.07 results (df = 113, 139), which 
is not significant (p>.05). We may 
assume, therefore, that the’ groups 
were comparable in their ability to 
learn such a discrimination. 

As may be seen in Table 1, the effect 
of the more stringent (10-trial) crite- 
rion was to increase the number of 
training trials for Ss in these groups, 
i.e., to result in overtraining for these 
Ss. Comparing the mean number of 
trials required by the 10-trial and the 
4-trial groups to reach their respective 
criteria (again excluding the criterial 
trials) an F of 1.34 results (df = 132, 
140), which is significant at between 
the .05 and .01 levels. 

The mean number of times that 
each box was chosen up to the criterion 
trials may be seen in Table 2. It is 
clear that some overtraining on the 
correct choice had occurred before the 
ten criterion trials commenced in the 
10-1 groups, for the difference 

2tween the means of the 4- and 10- 
trial groups is notably greater for the 
correct stimuli than it is for any of the 
other stimuli. Excluding the criterion 
trials, the correct stimulus was chosen 





|| 


| mare | 








TRANSPOSITION IN CHILDREN 253 


on the average 4.2 (11.8 vs. 7.6) more 
times in the 10-trial groups than in the 
4-trial groups. A comparison of the 
difference between these means is 
significant at beyond the .01 level 
(F = 1.55; df = 195, 88). 

Test trials.—If transposition is tested 
by determining the number of Ss who 
responded relatively on the first test 
trial, it is found that 83 of the 140 Ss, 
or 60%, showed transposition. Of 
the remainder, 38 Ss chose the stim- 
ulus on which they had been ‘rained, 
and 19 chose the third stimulus. The 
breakdown of these data is found in 
Table 2. The percentage of Ss show- 
ing transposition is notably lower than 
that usually found in two-stimulus 
transposition for younger Ss similarly 
tested with stimuli one step removed 
from the training stimulus. Unfor- 
tunately, additional data for Ss of this 
age level or for three-stimulus trans- 
position are not available by which 
this relatively low frequency might be 
explained. 

Of the 83 Ss who showed transposi- 
tion, 38 were in the 4-trial and 45 were 
in the 10-trial groups. This difference 
is not significant (x? = 2.4; .10<p< 
.20), and we may conclude that the 
degree of overtraining employed here 
did not significantly increase the fre- 
quency of transposition on the first 
test trial. Further, there was no dif- 
ference between the Large and Small 
groups in their frequency of relative 


choice (x? = .92). In nocase through- 
out the remainder of the experiment 
was a significant difference found be- 
tween these groups, and henceforth 
the Large and Small groups will be 
combined. 

If we apply a more sensitive test of 
transposition, that of consistency of 
response on the test trials, the pic- 
ture changes radically. Since, in the 
method used in this experiment all Ss 
were given at least four test trials, we 
may determine the number of Ss who 
made a relative choice consistently on 
the first four test trials. The number 
of Ss who may be judged to show 
transposition according to this crite- 
rion is 72, and of these only 28 were in 
the 4-trial groups, while 44 were in the 
10-trial groups. The chi square that 
results here is 7.31, which with 1 df is 
significant at beyond the .01 level. 
Further evidence of this difference is 
provided by the fact that the mean 
number of trials required (up to, but 
not including the four criterial test 
trials) by the 10-trial groups was 1.27 
and by the 4trial groups was 3.87. 
Again applying the Festinger test, this 
difference is significant at beyond the 
001 level (F = 3.04; df = 32, 157). 

On the first trial of the second set of 
transposition trials, 116 of the 140 Ss 
made a relational response, an increase 
in frequency of 23.4% over the first 
trial of the first set of transposition 
trials. In this trial the 10-trial groups 














TABLE 2 
Tue Ss’ Cuoitces on Tratninc anp Test TRIALS 
Training Trials Stimuli Test-1 Stimuli Test-2 Stimuli 
Means) (Frequencies) (Frequencies) 
Group 
Small Medium Large Small Medium Large Small Medium Large 
Large-4 5.5 5.5 5.7° 6 9 20* 6 3 26* 
Large-10 7.5 8.9 12.5* 7 25* 5 0 30* 
Small-4 9.5* 7.5 6.6 17* 12 6 26* 5 4 
Small-10 11.1* 7.9 7.5 21* 10 4 34* 0 1 
































* Correct stimulus. 
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made a significantly greater number 
of relational choices than did the 4- 
trial groups. The chi square for fre- 
quency of relational versus nonrela- 
tional responses in the two groups is 
7.24, which is significant at beyond the 
.O1 level (df = 1). Utilizing consis- 
tency of relational responses as the 
measure of transposition, we find that 
105 Ss met this criterion, with 46 in 
the 4-trial groups and 59 in the 10-trial 
groups. In comparing these frequen- 
cies a significant chi square is also 
obtained (6.44; .01<p<.02). The 
choices made on the first trial of the 
second set of transposition trials are 
shown in Table 2. 

In summary, the frequency with 
which Ss in the 4- and 10-trial groups 
chose the relative stimulus did not 
differ significantly on the first trial of 
the first set of test trials, but was 
significantly greater in the 10-trial 
groups on the first trial of the second 
set of test trials. On both sets of test 
trials the 10-trial groups showed a 
significantly greater frequency of con- 
sistent relative choices than did the 
4-trial groups. 

V erbalization—By the end of the 
experiment each S had been forced to 
make the same relative choice on at 
least four training trials and on two 
sets of test trials, yet only 111 of the 
140 Ss were able to verbalize their 
basis of response. Those who did ver- 
balize gave answers varying from 
terse explanations that it was always 
the biggest (or smallest) square to 
detailed descriptions of the procedure 
for the entire experiment. Those who 
did not verbalize appropriately said 
they did not know or made such un- 
definitive statements as “It’s in that 
box.” The nonverbalizers proved to 
be somewhat younger than the ver- 
balizers, being on the average 9 
mo. younger. Seventeen of the 
nonverbalizers were in the 4-trial 
groups and 12 in the 10-trial groups. 


TABLE 3 


Cuorces Mave sy VERBAL AND NonvERBAL Ss 
ON First TRANSPOSITION TRIALS 








Class of Subjects 
Type of Response 











Verbal Nonverbal 
Relational 72 11 
Nonrelational 39 18 
Consistent relational 66 6 
Nonconsistent relational 45 23 











The nonverbalizers did significantly 
poorer in transposition than did the 
verbalizers. Testing the frequency 
with which the verbalizers and non- 
verbalizers made relational responses 
on the first test trial of the first trans- 
position series as seen in Table 3, a 
chi square of 6.91 results, which with 
1 df is significant at beyond the .01 
level. Testing the difference in num- 
ber of Ss making consistent correct 
responses on the first test series, a chi 
square of 14.09 is obtained, which is 
significant at well beyond the .001 
level. The ability to verbalize, then, 
was positively correlated with the 
tendency to make relational choices on 
the test trials. 


Discussion 


The results of this experiment only 
partly support the generalizations con- 
cerning overtraining that were made by 
Jackson et a/. (6), for although overtrain- 
ing affected the consistency with which 
relative responses were made, it did not 
have a significant effect in increasing the 
initial frequency of relative responses 
until a second set of test trials was given. 

In interpreting this experiment, then, 
it is proposed that the responses were a 
result of a conceptual process rather than 
of a strong tendency towards perceiving 
situations relationally or of an interac- 
tion of habit strengths. This interpre- 
tation is supported by the high relation- 
ship that was found between ability to 
verbalize at the end of the experiment 
and the incidence of transposition. Sim- 
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ilarly, both Kuenne (8) and Alberts and 
Ehrenfreund (1) found that those Ss who 
were able to abstract from their experi- 
ence to the degree that they could ver- 
balize appropriately concerning the basis 
of response showed the greatest frequency 
of transposition. 

We may hypothesize that two con- 
cepts had to be developed, a concept of 
size and of frequency. That is, first the 
Ss had to develop some concept about the 
particular type of stimulus that would be 
correct, and second a concept about the 
frequency with which this stimulus would 
be correct on any series of trials. The 
primary effect of overtraining apparently 
was that of establishing a clearer concept 
of consistency than of relative size. Once 
the overtrained Ss found a stimulus to 
be correct, irrespective of the basis of 
their choice, they continued to choose the 
correct stimulus to a greater degree than 
did the Ss in the 4-trial groups. The 
effect of overtraining on initial choice of 
the test stimuli was not apparent until 
the second set of transposition trials, 
when the more complete mastery of the 
initial discrimination did enable these Ss 
to learn more readily the principle of 
response common to the training and the 
first set of test trials. 

Overtraining, then, may be effective 
in increasing the frequency of transposi- 
tion, but in the present experiment its 
effects seem to have resulted from the 
expanded opportunity of Ss to discover 
that similar principles applied in two 
series of stimuli with similar relational 
characteristics. It may be predicted, 
therefore, that overtraining with the 
same set of stimuli would have less effect 
on transposition than if some of the over- 
training were given with a different set 
of stimuli embodying the same principle. 


SUMMARY 


A group of 70 Ss was trained to choose the 
largest of three stimuli and a second group of 70 
Ss was trained to choose the smallest. Half of 
each group in turn were required to reach a cri- 
terion of either four or ten consecutive correct 
responses. ‘Two sets of transposition trials were 
made with stimuli removed one step in either 
direction from the training stimuli. The follow- 
ing results were obtained: 


1. The amount of transposition was less than 
that found in previous studies with younger chil- 
dren trained on two stimuli. 

2. No significant differences in transposition 
behavior were associated with the size of the 
training stimulus. 

3. No significant differences in number of rela- 
tive choices on the first trial of the first transposi- 
tion series were found between the groups dif- 
fering in degree of original training, but signifi- 
cant differences favoring the overtrained groups 
did arise on the first trial of the second set of test 
trials. 

4. In both sets of test trials the overtrained 
Ss chose the correct stimulus in a consistent 
sequence to a greater degree than the less well- 
trained Ss did. 

5. Appropriate verbalization was found to 
be positively related to the ability to show 
transposition. 
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EFFECTS OF NONREINFORCED TRIALS ON 
DISCRIMINATION LEARNING IN 
PRESCHOOL CHILDREN 
GORDON N. CANTOR AND CHARLES C. SPIKER! 

Child Welfare Research Station, State University of Iowa 


Various approaches to discrimina- 
tion learning phenomena, such as that 
represented by Hullian theory (2) 
and the Spence discrimination theory 
(3), maintain that learning occurs 
in a discrimination learning situation 
owing to the nonrewarding of incorrect 
responses, as well as to the rewarding 
of correct responses, provided that an 
initial tendency exists to approach the 
incorrect stimulus. In an investiga- 
tion of discrimination learning, Denny 
and Dunham (1) found that a group 
of rats given four nonreinforced and 
two reinforced trials per day learned 
a discrimination task in a simple T 
maze significantly better than did a 
group given one nonreinforced and 
two reinforced trials per day. The 
present study is designed to investi- 
gate the effect of manipulation of the 
frequency of nonreinforcement on the 
performance of preschool children on a 
forced-choice discrimination learning 
task. 

METHOD 

Subjects —The Ss were 48 children from the 
State University of lowa preschools. There were 
13 three-year-olds, 18 four-year-olds, and 17 five- 
year-olds; 26 were males, and 22 were females. 

Procedure.—Figure 1 shows the main part of 
the apparatus. Tracks were arranged down 
which S could run a wooden railroad car from a 
standard toy set. Two such cars were used; 
these were identical in appearance, except that 
one was painted red, the other green. Each child 


was run individually. The S stood facing the 
side of the tracks with the high ends at his left. 


1The writers are indebted to Mr. Robert 
Seibel for technical assistance. 
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The experiment began with a demonstration 
series in which E rolled each of the cars down one 
of the tracks. Upon contact of a car with the 
dummy switch at the end of a track, E activated 
the marble dispenser by means of a hidden foot 
switch, causing the marble dispenser to roll a 
j-in. steel bearing into a receptacle accessible to 
S’s right hand. The S was led to believe that 
the car’s striking the dummy switch was what 
caused the marble dispenser to operate, and was 
told to place the marbles obtained during the 
demonstration series in a container which lay on 
the table behind the tracks. He was also in- 
formed that the object of the “game” was to fill 
this container with marbles. 

The £ then began presentation of the dis- 


crimination learning task to S. For each S, free 


choice trials were alternated with blocks of forced 
trials. The task began in every case with a free 
choice trial in which both cars were presented on 
the delivery mechanism to S, who was instructed 
to “take just one.” The S chose one of the cars 
and rolled it down either of the tracks. If this 
choice was to be reinforced, E caused the dis- 
penser to roll a marble to S; if the choice was not 
to be reinforced, no marble was delivered. Next, 
S was given his first block of forced trials; these 
consisted in the presentation to S of only one 
stimulus at a time, one color being consistently 
reinforced for the given S, while the other was 
consistently nonreinforced. The alternation of 
single free choice trials with blocks of forced 
trials was continued in exactly this manner until 
S had been given nine free choice trials and eight 
blocks of forced trials. 

The manipulated variable of basic interest in- 
volved the number and kind of forced trials 
given in each of the eight blocks. For half of the 
Ss, two rewarded and two nonrewarded forced 
trials were given in each block, while for the 
other half, two rewarded but only one non- 
rewarded forced trial was given per block. It 
was predicted that the former group would sur- 
pass the latter in number of correct choices made 
on the free choice trials. 

For half of each of the above-described groups, 
all free choice trials were reinforced, regardless of 
correctness or incorrectness of choice, while for 
the other half, all free choices were nonreinforced. 





oe wewoenrenFwer? wm ” 


_ 
— 





DISCRIMINATION LEARNING 257 


DW LE F 





























A~- MARBLE DISPENSER 

B ~ SCREEN 

C- CAR TRACKS 

D~ DUMMY SWITCHES 

F-CAR DELIVERY MECHANISM 
G-LIVE FOOT SWITCH 


Fic. 1. Schematic drawing of apparatus 


REsuLTs AND DiscussIoNn 


Taking into account the two vari- 
ables of amount of nonreinforcement 
during forced trials and presence or 
absence of reinforcement on free 
choice trials, four treatment groups 
are seen to be involved, as follows: 
(I) forced trials reinforced and nonre- 
inforced equally, free choice trials rein- 
forced; (II) forced trials reinforced 
and nonreinforced equally, free choice 
trials nonreinforced ; (III) forced trials 
reinforced and nonreinforced une- 
qually, free choice trials reinforced; 
and (IV) forced trials reinforced and 
nonreinforced unequally, free choice 
trials nonreinforced. The measure of 
interest in the analysis is the number 


TABLE 1 


Frequency or Correct Cuoices on FREE 
Cuoice TRIALS 








Subjects ox 
Mean 


Group 








Months) 
F ( 
Total “Males 


ah Mean| SD 





I 12 9 3 50.42 | 5.83 | 1.57 
II 12 4 8 49.50 | 6.08 | 1.71 
III | 12 7 5 53.42 | 4.17 | 1.57 
IV | 12 6 6 50.33 | 5.08 | 1.38 























TABLE 2 


ANALYsis OF VARIANCE OF FREE 
Cuorce Data 








Source df i. F 





Reinforced vs. nonrein- 
forced free choices 1 4.09 1.54 
2 vs. 1 nonreinforced 














trials per block 1 | 21.34] 8.02* 
Interaction 1 1.32 |}<1.00 
Within cells 44 2.66 
Total 47 

*p = Ol. 


of correct choices made out of the last 
eight of the nine free choice trials. 
The means and sigmas of the four 
treatment groups are shown in Table 
1. 

The basic analysis of the data in- 
volved a 2 X 2 factorial design which 
compared the performance of Ss with 
two nonreinforcements per block of 
forced trials.with that of Ss with only 
one nonreinforcement per block, and 
compared the performance of Ss 
having all free choices reinforced with 
that of Ss having all free choices non- 
reinforced. Table 2 summarizes the 
results of the analysis of variance. 

The interaction is nonsignificant, as 
is the main effect for reinforcement vs. 
nonreinforcement of the free choice 
trials. For the main effect of equal 
vs. unequal reinforcement and nonre- 
inforcement in blocks of forced trials, 
however, the obtained F of 8.02, with 
1 and 44 df, is significant at the .01 
level. That is, the data indicate that 
negative instances forced upon S dur- 
ing training of this nature aid in the 
learning process. The combined mean 
for Ss reinforced and nonreinforced 
equally in forced trials (Groups I and 
IT) is 5.96, while the mean for Ss rein- 
forced and nonreinforced unequally 
(Groups III and IV) is 4.62. It ap- 
parently made no difference in the 
learning whether £ arbitrarily rein- 
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forced or nonreinforced all free choices. 
The combined mean for Ss reinforced 
on all free choice trials (Groups I and 
III) is 5.00, while the mean for Ss non- 


reinforced on free choice trials is 5.58. 


A chi-square analysis of the data for 
the first trial revealed no significant posi- 
tion-color preference in the Ss. A re- 
cording of any remarks made by the Ss 
concerning a color or colors involved in 
the experiment reveals that, of the 48 
Ss, 28 did make some mention of color, 
whereas 20 did not. A comparison of 
these two groups in terms of number of 
correct choices on the last eight free 
choice trials resulted in a ¢ value of 2.02 
which, for 46 df, is significant at the .05 
level. Inspection of the data, however, 
indicates a marked tendency for those Ss 
to mention color who were given as much 
nonreinforcement as reinforcement in the 
training trials. This fact detracts from 
the importance of this comparison of 
color verbalizers and color nonverbalizers, 
since nothing can be said concerning this 
factor independent of the main treatment 
effects. 

The simple analysis of variance com- 
paring the three age groups (3-, 4-, and 
5-year-olds) resulted in an F of less than 
1, indicating no difference in performance 
on the part of these age groups. 

The data agree with the results ob- 
tained by Denny and Dunham (1) con- 


cerning the enhancing effect of negative 
“training” instances on discrimination 
learning and thus extend the generality 
of the findings of these investigators. 
The contention of the Spence discrimi- 
nation learning theory in regard to the 
role of nonreinforcement is clearly sup- 
ported by the present results. 


SUMMARY 


Forty-eight children, ages three to five years, 
were run in a forced choice simple discrimination 
learning situation in which blocks of forced 
choices were alternated with free choice trials. 
Half the Ss received two reinforced and two non- 
reinforced trials in each block of forced trials, 
while the other half received two reinforced but 
only one nonreinforced trial per block. The 
performance of the former group on the free 
choice trials was significantly superior to that of 
the latter group, indicating the importance of 
nonreinforcement in discrimination learning. 
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RESPONSE CONSISTENCY IN PERCEPTION 
AND RETENTION 


EUGENE §. GOLLIN AND ALAN BARON 
Brooklyn College 


Recent experimental studies by 
Witkin (17, 18) and Klein (6) have 
demonstrated the existence of wide 
individual differences in a variety of 
perceptual situations. It was also 
found that individuals tended to 
respond in a consistent fashion in the 
various perceptual situations. This 
consistency of response style, as well as 
the consistency of differences between 
Ss, has permitted Witkin and Klein 
to characterize Ss in terms of percep- 
tual performance. If performance in 
perceptual situations is characteristic 
of individuals, that is, reflective of 
central response tendencies, then it 
should relate to performance in behav- 
ioral situations generally regarded as 
being nonperceptual. This relation- 
ship should be especially evident when 
the representative test situations have 
common organizational properties. 

Gestalt psychologists have likened 
the processes involved in retention to 
those operating in visual perception. 
They have described retroactive inhi- 
bition (RI) situations in terms of 
figure-ground organization and in this 
way have accounted for the superior 
retention of dissimilar elements in 
otherwise homogeneous material. The 
distinctiveness existing between suc- 
cessively learned materials influences 
their later retention in the same way 
that the distinctiveness existing be- 
tween aspects of the visual field influ- 
ences their perception (7). Other in- 
vestigations of the RI phenomenon 
have shown that varying the degree of 
distinctiveness between successively 
learned materials (overlearning, dis- 
similarity, etc.) will facilitate the later 


reproduction of the originally learned 
material (2, 4, 8, 9, 10, 11, 12, 13, 15). 

A perceptual situation which per- 
mits manipulation of distinctiveness 
between aspects of the visual field is 
the Gottschaldt test. Discovery of 
the hidden figure in this test involves 
a discrimination between a simple 
figure with which S has had previous 
experience and the context in which it 
is embedded. Embeddedness of the 
hidden figure can be accomplished in 
varying degrees in much the same way 
that similarity between successively 
learned materials can be manipulated 
in RI situations. 

In both Gottschaldt and RI, S is 
directed to produce material which is 
obscured by experimental manipula- 
tion of stimulus conditions. It has 
been demonstrated that when these 
conditions are held constant for all Ss 
in the Gottschaldt, wide individual 
differences in performance emerge (14, 
18). Individual differences are also 
forthcoming in RI situations. 

In each of these tasks, performance 
is related to the ease with which the 
figural material is maintained in the 
face of hindering field conditions. 
When test conditions are held con- 
stant for all Ss, resulting differences 
in performance may reasonably be 
ascribed to individual differences in 
the facility with which a figure can be 
maintained against a ground. If Ss 
perform consistently in Gottschaldt 
and RI, evidence will be provided for 
the pervasiveness of this facility. 


MeTHOD 


Subjects.—The Ss were 25 male students ran- 
domly selected from introductory psychology 
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classes and assigned to a control group (N = 5) 
and an experimental group (N = 20). 

Materials.—The Gottschaldt test consisted of 
five sets of figures, one of which was used for 
practice. Each set contained a simple inspec- 
tion figure and seven complex figures in which the 
simple figure was embedded. Thus, in addition 
to the practice set there were 28 complex figures. 
The figures in the test were taken from the 
Gottschaldt originals, from those used in the 
experimental psychology laboratory at Brooklyn 
College, and were in part independently con- 
structed. 

In the RI situation two lists of nonsense syl- 
lables were used. There were ten syllables in 
each list and they had Glaze association values 
between 20% and 33% (5). 

Procedure and design—The S was presented 
with the practice Gottschaldt set, and after in- 
specting the simple figure was requested to out- 
line it in pencil on the seven complex figures. In 
the test proper, S inspected the simple figure for 
10 sec. and then attempted to outline it on the 
successively presented complex figures. The Ss 
were timed to the nearest second on each com- 
plex figure. Timing was stopped if S incorrectly 
outlined the simple figure and he was informed of 
his error. The S was allowed to inspect the 
simple figure at any time in the course of the 
testing. During such inspection, timing was also 
stopped. At no time were Ss permitted to see 
the simple and complex figures together. The Ss 
were allowed a maximum of 60 sec. in which to 
solve each complex figure. 

In the RI situation the nonsense syllables were 
presented in an electrically driven circular mem- 
ory drum at a constant rate of speed, each syl- 
lable being exposed for 2 sec. As each syllable 
appeared, S pronounced it aloud. Mispronunci- 
ations were corrected on the first trial. At the 
end of each trial, Ss were given a 4 X 6-in. file 
card and requested to write as many of the syl- 
lables as they could in the same order as pre- 
sented and with the same spelling. The cards 
were removed as soon as S had finished writing. 
This procedure was repeated after each trial 
until S was able to write the list twice consecu- 
tively without error. To compensate for the 
lack of a practice series, learning was preceded 
by a demonstration of the apparatus and exten- 
sive instructions. 

The control group served only in the RI situ- 
ation. Both experimental and control groups 
learned the first list of syllables. Experimental 
Ss then learned the second list while control Ss 
rested. The rest period was presented as a test 
in which control Ss were required to select from a 
book of cartoons the three they liked best. The 
Ss in both groups were then tested for recall by 
having them write as many syllables of the first 


list as they were able to produce. No more than 
1 min. was allowed for this. The Ss were then 
required to relearn the first list to a criterion of 
two perfect trials. 

The tests were administered in counterbal- 
anced order, half the experimental Ss taking the 
Gottschaldt first and half participating in the 
RI situation first. 


RESULTS 


In the RI situation List I (originally 
learned list) and List II (interpolated 
list) were of approximately equal diffi- 
culty. The mean number of trials 
required to learn List I was 11.90 as 
compared with 10.40 for List II 
(¢ = 1.45, p<.20). 

The control and experimental groups 
were adequately matched with regard 
to original learning. A statistical 
comparison of the mean number of 
trials required by each group to reach 
the criterion yielded a t = .83 (p> 
40). 

The interference effect of List II is 
demonstrated by an RI percentage of 
74 obtained by use of the standard 
formula (1), mean recall for the con- 
trol group being 8.4 syllables as com- 
pared with 2.2 syllables for the experi- 
mental group. 

In the Gottschaldt test, mean time 
scores ranged from 5.39 sec. to 29.75 
sec. The mean of the mean time 
scores was 14.59 sec. Although Ss 
differed markedly from one another in 
terms of mean time scores, they re- 
mained fairly self-consistent through- 
out the trials as indicated by a corre- 
lation of .81 between the 14 odd and 
14 even trials. 

In order to test the hypothesized 
pervasiveness of figure-ground facil- 
ity, it was necessary to compare per- 
formance in the Gottschaldt test with 
performances in the learning situa- 
tions. Gottschaldt time scores were 
converted to speed scores by reciprocal 
transformation (3) and rank-order 
correlations were run between Gott- 
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TABLE 1 


Ranx-Orpver CorRELATIONS 


























Sav- | Relearn-} Recall | Original 
Measure ing ing Score | Learning 
Score} Trials Trials 
Gottschaldt 
speed (1/time)| .58* | —.51**| .45**| —.08 
Relearning trials —.74* .28 
Recall score .82* —.08 
*p<.0l 
* > <.05 


schaldt speed and number of trials 
required for relearning, number of 
trials required for original learning, 
number of syllables achieved on the 
first relearning trial (recall score), 
and savings score. Other relevant 
rho’s were also obtained. 

Inspection of Table 1 indicates that 
the magnitude of the correlations 
between Gottschaldt speed and re- 
learning trials, recall scores, and sav- 
ing scores are significantly different 
from zero. These results show that 
the test situations are productive of 
individual differences and that Ss who 
are faster on the Gottschaldt test tend 
to be less susceptible to the interference 
in the RI situation. It should be 
noted that while a relationship signifi- 
cantly different from zero exists be- 
tween Gottschaldt and recall, no such 
finding was obtained between either 
of these and original learning trials. 
Furthermore, the rho between original 
learning trials and relearning trials 
is not significantly different from zero. 
Gottschaldt, recall scores, and relearn- 
ing trials seem to test the stability 
with which previously experienced 
material is maintained under condi- 
tions of interference, while original 
learning trials seem to test the rate of 
establishment of material. 


Discussion 


The results of this study confirm Wit- 
kin’s findings of a wide array of individual 


differences in the Gottschaldt. This find- 
ing, as Witkin points out, “. . . proves 
that it is necessary to consider character- 
istics of the people perceiving the figures 
as well as characteristics of the figures 
themselves. The observation that peo- 
ple tend to be self-consistent in the ease 
or difficulty with which they escape the 
influence of the complex pattern points 
to the importance of the personal factors 
responsible for individual differences” 
(18, p. 13). 

In the retention tests utilized in this 
study marked differences between indi- 
viduals were also found, indicating that 
the number of trials required for relearn- 
ing as well as recall as measured by num- 
ber of syllables produced on the first 
relearning trial depended not only on the 
nature of the learning tasks but also on 
personal factors. 

A consideration of the structural prop- 
erties of the tasks involved in this experi- 
ment suggested that performance was 
related to the facility with which figural 
material could be produced in the face of 
hindering field conditions. 

The fact that Ss who are low on 
relearning trials and high on recall scores 
tend to locate hidden figures with greater 
speed suggests that this facility is per- 
vasive. This demonstration of perva- 
siveness of response style, is, of course, 
confined to the conditions of the experi- 
ment. Moreover, no statement can be 
made regarding the source of the differ- 
ences between individuals. It is felt, 
however, that motivational variables 
may contribute to figure-ground facility. 

The experimental findings suggest that 
ability to make distinctions between 
aspects of given fields may represent a 
useful personological dimension. In this 
regard, the presence of figure-ground rela- 
tionships outside the perceptual area has 
been noted by Woodworth (19). Werner 
has utilized a similar conception in a 
study of proactive inhibition (16). 


SUMMARY 


A retention task (RI situation) and a percep- 
tion task (modified Gottschaldt test) having 
common organizational properties were pre- 
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sented to Ss in order to determine whether they 
would respond consistently in the two test situ- 
ations. Evidence of consistency of response was 
forthcoming. 
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APPARENT SPATIAL ARRANGEMENT AND 
PERCEIVED BRIGHTNESS 


JULIAN E. HOCHBERG 


Cornell University 


The problems of brightness con- 
stancy (e.g., the constancy of per- 
ceived object color under different 
illumination conditions), and of the 
perceptual constancies in general, arise 
from the fact that changed sensory 
stimuli frequently elicit unchanged 
responses (and vice versa) which fol- 
low more closely the variations of dis- 
tal stimuli (objects) than of the 
sensory-surface stimulus distributions. 
This raises difficulties for any formu- 
lation of a one-to-one correspondence 
between stimulus and experience (con- 
fusingly called the “constancy hypoth- 
esis,” [5, p. 86]), which at first sight 
would seem essential to psychological 
prediction. 

Such findings have been used in 
attempted (“nativistic”) refutation of 
the constancy hypothesis and its asso- 
ciated stimulus-sensation units of 
analysis (5), and to demonstrate the 
importance of nonstimulus organiza- 
tional “forces.” Empiricist “‘inferen- 
tial” explanations, on the other hand, 
retain the constancy hypothesis in 
sensation, and ascribe the obtained 
discrepancies to the effects of past 
experience in perception. Objections 
to the nativistic position are: (a) some 
evidence suggests that the accuracy of 
the perceptual constancies depends on 
past experience (1); () no well-defined 
analytic units have been presented to 
supplant the old “sensations,” and in 
their absence precise prediction is diffi- 
cult despite the considerable heuristic 
value of the more or less intuitive 
Gestalt “laws.” General objections to 
the empiricist positions have been: (a) 
it is not possible to distinguish between 
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there is some awkwardness involved 
in the doctrine of “unconscious infer- 
ence” and its derivatives, especially 
when referred to the lower animals in 
which the constancies appear (7, pp. 
605-607); (c) any attempt at precise 
prediction from this viewpoint must 
await as yet unperformed “ecological 
surveys” to determine what the past 
experiences of an organism are likely 
to have been; (d) the constancies also 
appear to exist without opportunity 
for past experience (3), and while 
demonstrated effects of past experience 
on the constancies do not necessarily 
refute the Gestalt position, evidence 
of the reverse seriously injures a thor- 
oughgoing empiricist explanation. 

An alternative formulation is ap- 
pealing: responses may occur in one- 
to-one correspondence not to what we 
had previously taken to be the stimuli 
but to their relationship, without 
regard to central factors, whether of 
association or organization. In con- 
sidering this possibility, we do not 
have to postulate innate knowledge; 
we need only seek new dimensions for 
analyzing the physical stimuli which 
are in correspondence with experience 
(or response). Gestaltists most fre- 
quently sought such invariant relation- 
ship not in the stimulus distribution, 
but in the as yet largely unmeasurable 
psychophysiologically isomorphic cor- 
tical processes; however, one may 
instead direct attention to the reanal- 
ysis of the proximal stimulus pattern 
as do Gibson (2), Helson (4), and 
Wallach (6). 

Thus, Wallach (6) suggests that we 
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Fic. 1. Apparatus for presenting the same target at different apparent slants and illumination 
conditions 


may understand brightness perception 
by taking as the stimulus not the 
intensity of illumination falling on a 
given retinal region, but the relation- 
ship of the intensities of illumination 
falling on adjacent regions. The rela- 
tionship approximated a ratio of inten- 
sities in the situations he studied; i.e., 
the stimuli in the perception of bright- 
ness appeared to be the ratios of illu- 
mination intensities on adjacent areas, 
rather than the illumination inten- 
sities themselves. Thus, if Ss viewed 
a variable disk surrounded by a ring 
of 180 illumination intensity units— 
degrees of episcotister opening—and 
were asked to match the variable disk 
to a disk of 90 units surrounded by a 
ring of 360 units, they set the variable 
disk to a mean value of 47 units, only 
2 units away from the proportionate 
4:1 intensity ratio, which would here 
be 90 + 360 X 180, or 45 units. If 
this redefinition of the stimulus will 
explain all of brightness constancy, we 
can again effect a one-to-one psycho- 
physical formulation of perceived 
brightness. 


The constancy hypothesis was 
shaken since the same absolute stimulus 
intensities aroused different brightness 
responses (and vice versa) ; can condi- 
tions also be found in which the same 
distributions or relationships of stim- 
ulus intensity arouse different bright- 
ness responses? The object of the 
present experiments was to determine 


TABLE 1 


RetativE APPARENT BRIGHTNESSES OF THE 
TARGET WITH DIFFERENT APPARENT 
PostTions AND ILLUMINATION 
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II 15 a 14 0 
by 0 14 
III D 5 a 5 0 
E 5 by 0 5 
F 5 0 0 
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whether a change in the apparent posi- 
tion of a target surface relative to an 
illumination source results in a change 
in the perceived brightness of that 
surface (cf. 7, pp. 600, 612), even 
though the actual illumination condi- 
tions remain constant. 


EXPERIMENT I 


The apparatus! is shown in Fig. 1: the main 
illumination (100 w.) came from above (a), this 
being “indicated” to Ss by the shadow distribu- 
tion on several cubes (d). These “cues” are of 
considerable importance since, in preliminary 
experiments, little or no success was achieved 
without them. The Ss looked monocularly 
through a reduction screen (Sc) at an upright 
cardboard trapezoid target (t), covered with 
Number 8 Hering gray paper, cut so that its 
retinal image would be the same as that of a 
square (Sq) lying flat on the black cloth surface 
(V). All Ss (N = 13) reported seeing a hori- 
zontal square, whose brightness they were then 
(Judgment I) asked to match quickly and unana- 
lytically to a scale (H) of Hering gray paper 
patches (Nos. 1, 3,5, . . . 19). A round rod, r, 
# in. in diameter, 22 in. long, painted white for 
Group A and black for Group B, was then waved 
behind the target (trying to avoid any cast 
shadows visible to S). This was kept up for 
some seconds, as it was difficult not to “see” the 
horizontal square (Sg) instead of the upright 
trapezoid (t); indeed, one S was dropped at this 
point, unable to see the target as upright. The 
Ss again compared the target’s brightness with 
the gray scale (Judgment II). 


The results (Table 1) indicate that 
the target when apparently upright 
is reported as brighter than when 
apparently horizontal. Two questions 
may, however, be asked: First, while 
illumination of the rod is not likely to 
have been responsible for the bright- 
ness change since it was black for some 
Ss and white for others, might not 
inadvertently cast shadows, or even 
the motion itself, have been the impor- 
tant factor? Second, if a stimulus is 
perceived as parallel to the line of 
regard, it should have a greater appar- 
ent area than when perceived as per- 
pendicular to the line of regard (Fig. 
1); may not the lower reported bright- 

1 Modified from one devised by Professor J. J. 


Gibson to study the relationship between per- 
ceived slant and perceived form. 


ness in the former case be due to the 
smaller amount of retinal illumination 
per unit of perceived surface? The 
next two experiments were undertaken 
to test the first question by varying 
the means whereby the apparent shift 
in the target position is brought about, 
and to test the second by changing the 
direction of illumination while, of 
course, holding the apparent size 
change constant. 


EXPERIMENT II 


The procedure of Exp. I was modified here in 
three ways: (a) When attempting to make the 
target appear upright, it was moved through 
short horizontal arcs (R) instead of having a rod 
(r) waved behind it. (b) The 15 Ss (Group C) 
of this experiment ran through the procedure 
with the illumination coming from above (a), 
and then repeated the experiment with the illum- 
ination coming horizontally from in front of the 
trapezoid (b,, Fig. 1), with a concealed supple- 
mentary source (bz) to remove the shadow of the 
target (t) from the cues. (c) The Ss were alter- 
nated in each part of the experiment as to the 
condition to which they were first subjected, the 
upright target or the horizontal square. 

The results (Table 1) under illumi- 
nation from above are the same as 
those in Exp. I: when seen upright, the 
target appears brighter than when 
seen flat. Under illumination from 
in front (b,), the results are the re- 
verse: with horizontal illumination, 
the target appears less bright when 
seen as upright than when flat. Since 
the change in perceived area conse- 
quent upon the change in perceived 
target position would be the same 
both when illumination comes from 
above and from the front, we can 
reject the amount-of-illumination-per- 
perceived-surface-area as a determin- 
ing factor. This suggests that the 
brightness changes are due either 
largely or solely to the perceived 
change in target position with respect 
to the direction of illumination; this is 
supported by the results of the next 
experiment. 


EXPERIMENT III 


Two changes were here made in the pro- 
cedures of Exp. I: (a) The target was changed 
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from flat to upright appearance by shifting from 
monocular vision to binocular vision (through a 
larger hole, ZL). (5) Illumination was maintained 
from above (a) as in Exp. I, for Group D 
(N = 5), from in front horizontally (b;) as in the 
second part of Exp. II, for Group E (N = 5), 
and horizontally from the side (c), for Group F 
(N = 5). 


The results of Groups D and E 
show the same changes in brightness 
as were found in Exp. II. There is no 
evidence of any perceived brightness 
changes in Group F, in which there is 
no change in perceived orientation of 
the target with respect to the illumi- 
nation, since the illumination is paral- 
lel to the target surface in either of the 
two perceived positions. These results 
again suggest that the brightness 
changes are obtained due to the change 
in the relationship of the perceived 
direction of illumination and the per- 
ceived position of the surface it falls 
upon. 

Discussion 


In general, the results of these experi- 
ments suggest that when a surface of a 
given illumination is perceived as being 
perpendicular to the direction of illumi- 
nation, it appears less bright than when 
the same surface, with the same illumi- 
nation, seems parallel to the direction of 
illumination. How does this fit the vari- 
ous approaches to brightness constancy? 

A simple one-to-one correspondence of 
illumination and perceived brightness 
must as usual be rejected, since the same 
stimulus arouses different responses. 
Likewise, any attempt to bring the per- 
ception of brightness into one-to-one cor- 
respondence with illumination ratios is 
inadequate, since differing responses are 
here obtained with the same illumination 
relationships. Either we must view Wal- 
lach’s ratio formulation (or, for that 
matter, Helson’s “‘adaptation level” ex- 
planation) as incomplete, or hold that 
there are at least two different kinds of 
brightness constancy, one bound to the 
illumination conditions and the other not, 
an unparsimonious position. The gen- 
eral viewpoint may, however, be retained 
(as may also a Gestalt organizational one) 


if the determinants of perceived bright- 
ness include not only the peripheral illu- 
mination relationships but the “cues” to 
spatial position and the illumination 
direction (cf. 7, p. 612). The empiricist 
or “inferential” position, disconcertingly 
enough, seems well able to explain the 
findings, at least by hindsight: thus, to 
reflect a given amount of light to the eye, 
a surface parallel to the incident illumi- 
nation would have to havea higher albedo 
or brighter object color than would a sur- 
face perpendicular to the incident illumi- 
nation, and would therefore be “inferred” 
to be brighter. 


SUMMARY 


In order to determine whether perceived 
brightnesses can be brought into one-to-one cor- 
respondence with stimulus illumination relation- 
ships any more than with absolute illumination 
intensities, Ss made judgments of the brightness 
of a target which, under constant or controlled 
conditions of illumination, was made to appear 
to be either perpendicular or parallel to the 
apparent direction of illumination. Since sub- 
stantially the same illumination distributions 
produced different perceived brightnesses, anal- 
yses of brightness constancy in terms of stimulus 
illuminations cannot at present be considered 
complete explanations. 
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THE EFFECTS OF EXPERIMENTALLY INDUCED 
ATTITUDES UPON TASK PROFICIENCY 


R. B. PAYNE AND G. T. HAUTY 
USAF School of Aviation Medicine 


It is conventional to suppose that 
subjective dispositions or biases can 
affect the course of many kinds of 
behavior, presumably through their 
directional and/or dynamic proper- 
ties. This notion is especially preva- 
lent in work-decrement literature, 
which frequently alleges that such fac- 
tors as feelings, emotional tones, and 
task attitudes help determine task 
proficiency. In fact, it is central to 
those concepts of psychological fatigue 
which regard work decrement as symp- 
tomatic of cumulative task aversion 
and failure to maintain proper atti- 
tudes (4, 9). Typically, however, 
such factors are merely inferred as 
intervening variables to account for 
differences in task proficiency. Sel- 
dom have investigators deliberately 
and knowingly induced differential 
dispositions toward a task in order to 
observe their effects upon output. 
Nevertheless, only by this means can 
the role played by such variables be 
finally understood. 

The present study has sought to 
induce differential task dispositions by 
instructional and _ pharmacological 
means in order to observe their effects 
upon the maintenance of proficiency 
in a complicated compensatory pur- 
suit task. While the results provide 
a reasonably lucid answer to the cen- 
tral question, they have the additional 
merit of depicting the nature of cer- 
tain drug effects in independence of 
attitudes, and they reveal something 
about the combinatorial properties of 
dispositions generated by different 
methods. 


PROCEDURE 


Subjects—The Ss were 80 volunteer, experi- 
mentally naive, basic airmen whose ages ranged 
narrowly around 20 yr. 

Task.—Preliminary work indicated that the 
USAF SAM Multidimensional Pursuit Test 
(CM813E), developed by Loucks and described 
by Melton (8), would provide a task appropriate 
to the present study. Performance measures 
obtained thereon are typically quite reliable 
(Spearman-Brown corrected ro, = .85 for N = 
1,800), and, of greater importance, work decre- 
ment becomes manifest after about an hour’s 
practice. In this task S is confronted with a 
panel containing four instruments, the pointers 
of which are made to drift eccentrically and unre- 
latedly about their respective nulls. Each 
pointer is associated electronically with a simu- 
lated aircraft control device (throttle, rudder, or 
stick) so that the drifts can be counteracted. 
Each instrument system includes its own scoring 
relay, which is energized when the pointer is 
within the tolerance limits of the null position. 
The secondary contacts of the relays are serially 
connected with the clutch of an automatic timer 
which sums and stores the time over which the 
four pointers have been kept in their null posi- 
tions concurrently. The entire mechanism is 
under the control of a cycling system which 
meters out alternate work periods (1 min.) and 
rest periods (15 sec.) over any desired span of 
time. 

In the present study the task was broken into 
two series of trials. Following the delivery of 
formal instructions by E, Ss were given 40 trials 
in order to get beyond the early learning incre- 
ments to a substantial level of skill. This was 
followed by a 10-min. interval during which they 
were assigned in a predetermined random order 
to experimental treatments soon to be described. 
After the treatments had been delivered, Ss were 
put back to work on the task for a series of 192 
trials (4 hr.). 

Two copies of the test apparatus under the 
common control of a cycling device were used to 
test Ss in pairs, No S could observe another’s 
performance, nor did he receive information 
about his own progress aside from that gained 
momentarily from the pointers themselves. In- 
struments and controls were well lighted, ambient 
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noise was of low intensity, and temperature was 
held constant. 

Experimental treatments.—Differential task 
dispositions were induced in two ways, (a) by 
instructional methods, and (b) by pharmacolog- 
ical methods. The instructional method was 
based upon the hypothesis that task perform- 
ance is some positive function of the extent to 
which S is made to identify himself with the task 
and to appreciate the purpose and significance of 
what he is about to do. By this method we 
attempted to create two levels of attitude. To 
half of the Ss the following instruction, identified 
hereafter as I,, was given: 

“Now that you have learned to operate the 
apparatus we want to find out how well you can 
perform this task throughout a period of four 
hours. You are encouraged to do the very best 
you can during this time, for you will be giving 
us information that will help us deal with impor- 
tant practical problems.” 

The second instruction, identified hereafter as 
I;, was given to the remaining half of the Ss. 
I, contained I, in principle, but it elaborated the 
theme at length, stressing the similarity between 
the task and real flight, the necessity for alert- 
ness, and the important role played by S in 
helping the Air Force solve one of its most 
important problems. Both instructions were 
delivered in a uniform way by one E. 

The second method of induction took advan- 
tage of the analeptic and depressant properties of 
certain drugs (4, 6, 10, 11, 12, 15, 16). Each 
instructional group was divided into the follow- 
ing five treatment groups: (A) Control (no drug 
at all); (B) Placebo (lactose); (C) Analeptic 1 
(5 mg. dextro-amphetamine sulfate); (D) Ana- 
leptic 2 (20 mg. caffeine derivative) ; (E) Depres- 
sant (0.65 mg. hyoscine hydrobromide mixed 
with 50 mg. diphenhydramine hydrochloride). 
All preparations were delivered in No. 1 pink 
capsules during the 10-min. interval between the 
two series of trials after delivery of the instruc- 
tional treatments. They were introduced with 
the following statement: 

“We will give you a capsule, the effects of 
which we are interested in studying. The only 
information we can give you is that the capsule is 
not the least bit harmful or injurious in any way. 
Are you willing to participate by taking this 
capsule? You may refuse and be excused now 
if you wish.” 

It will be apparent from the foregoing infor- 
mation that a 5 X 2 factorial design was em- 
ployed, in which performance measures were 
taken both before and after the delivery of treat- 
ments. The 80 Ss were distributed equally and 
randomly among the subclasses. 

Questionnaire.—At the conclusion of the work 
period, Ss completed a questionnaire designed 


TABLE 1 


ANALysis OF VARIANCE oF ITEM 
Response VALUES 











Source df — F 

Drugs 4| 4.86 7.03 
Motivations 1 | 2.56 3.70 
Items 4| 2.8537 | 8.49 
DXM 4) 61 

DxI 16| .6834 | 2.03 
M XI 4] .1287 
DXMxI 16 3896 | 1.16 


Between Ss treated alike | 70 6914 
Pooled interaction Ss X 
items 280 | .336 


Total 399 














primarily to elicit information about their covert 
dispositions as they reviewed the experience from 
which they had just emerged. No claim is made, 
of course, for any particular validity of the items 
for this purpose, nor for any particular relia- 
bility. Their outcomes in relation to the experi- 
mental treatments will help decide whether such 
properties can be inferred. The response choices 
for each item were fitted to a simple three-point 
scale, the value 3 being assigned in advance to 
the choice thought to indicate the most favorable 
task disposition, and the value 1, to the other 
extreme. The “disposition” items, choices, and 
assigned choice values were as follows: 

1. Would you volunteer to go through some- 
thing similar to this (a) tomorrow (3), (b) next 
week (2), (c) never (1); 2. How tired do you 
feel? (a) could work longer (3), (b) had enough 
(2), (c) ready to quit earlier (1); 3. Did you get 
bored or lose interest in what you were doing? 
(a) no or somewhat (3), (b) moderately (2), (c) 
very much (1); 4. What effect did the pill have 
upon your performance? (a) helped it (3), (0) 
no effect (2), (c) hindered it (1); 5. Do you think 
this experiment is of use to the Air Force? (a) 
much (3), (b) moderate (2), (c) little (1). Other 
items of a more specific and concrete nature 
required Ss to specify one of three degrees of 
fatigue for each of eight body locations. 


REsuLts AND Discussion 


Subjective effects—A_ preliminary 
question is whether the experimental 
treatments did in fact induce the 
desired differences in task attitudes, 
as operationally defined. An analysis 
of the variance of item response values 
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will help decide. This analysis, shown 
in Table 1, is unnecessarily detailed 
for the problem at hand, but it serves 
our purpose well by showing that (a) 
the variation attributable to drugs is 
highly significant (p<.001), (b) the 
variation attributable to instructional 
groups is reasonably significant (p 
circa .03 when the null hypothesis is 
tested against the alternative Z [I. ] 
> #[1,]), and (c) the interaction of 
main effects is negligible (F<1). In 
order to achieve a more definitive 
analysis of the drug variation, mean 
item response values were computed 
(see Table 2), and ¢ tests of the null 
hypothesis against appropriate one- 
sided alternatives were performed. 
These show that the attitude level 
associated with the dextro-amphet- 
amine treatment was superior to those 
associated with any other pharmaco- 
logical variable (against the alterna- 


tive C>A, p = .001; C>B, p<.001; 


C>D, p = .04; and C>E, p = .001). 
We can be only moderately confident 
of the caffeine derivative effects, de- 
spite the fact that this mean ranked 
in second place (D>A, p = .12; D> 
B, p<.001; and D>E, p =.12). 
Curiously enough, the mean of the 
depressant drug is exceeded only by 
the analeptic means. It is about equal 
to the control mean, but higher than 
the lactose mean (p = .03). The con- 
trol mean is also higher than the lac- 
tose mean (p = .03). 

In summary, we seem to have dem- 
onstrated the beneficial effects of an 
instructional technique (I,) and two 
analeptic drugs upon attitudes, but 
not the negative effects of a depres- 
sant drug. No simple explanation 
capable of embracing all the facts 
seems readily available to account for 
the latter outcome. However, by 
assuming that the variation between 
control, placebo, and depressant treat- 


TABLE 2 





Mean Item Response VALUES 















































Items 
Drug Motive —_ Means 
1 2 3 | 4 5 | 

A I, 2.13 | 2.75 2.63 | 200 | 250 | 2.40 
Is 2.25 | 2.50 23:1 2 | 2. 4-28 

Means 2.19 | 2.63 250 | 200 | 256 | 2:38 

B I; 188 | 1.88 — 4: 4h + 1S 1.98 
I; 2.38 | 1.88 288 | 1.50 | 2.50 2.23 

Means 2.13 | 1.88 2.69 | 1.69 2.13 2.10 

Cc I, 263 | 2.88 2.) t@ | 2 | 2.78 
I, 2.75 2.75 -_.. 1. 28.1. 8: 1-22 

Means 2.69 2.81 294 | 269 | 275 | 2.78 

D Lh | 213 | 238 2.38 | 2.50 2.38 | 2.35 
I, | 225 | 2.75 2.88 | 275 | 288 | 2.70 

Means 2.19 2.56 2.63 2.63 2.63 | 2.53 

E I; 2.38 2.50 2.38 | 1.50 2.50 2.25 
I; 2.50 2.63 2.88 | 2.25 2.25 | 2.50 

Means 2.44 2.56 2.63 1.88 2.38 | 2.38 

Column |, 2.23 2.48 2.58 | 2.10 2.38 2.35 
Means I, 2.43 2.50 2.78 | 2.25 2.60 2.51 
Means 2.33 2.49 2.68 | 218 2.49 2.43 
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TABLE 3 


ANALYsIS OF VARIANCE oF FaticuE RATINGS 











Source df P ond F 

Drugs 4/ .3109 

Motivations 1} 1.1391 1.66 
Loci 7 |21.3372 | 59.15 
DXM 41 .7953 1.16 
DXL 28} .3975 | 1.10 
MxXL 7} .3104 

DXMxXL 28} .3908 |} 1.08 


Between Ss treated alike | 70| .6872 
Pooled interaction Ss X 
loci 490 | .3607 


Total 639 














ments is in fact fortuitous (p =.03 in 
all comparisons), it is possible to view 
the analeptics merely as preservatives 
of initial attitude levels, which, in the 
absence of this preservative, would 
have dropped to minimum measurable 
levels. That we are in fact dealing 
here with diffuse, covert, task-oriented 
dispositions rather than realistically 
perceptible focal discomforts is sup- 
ported by an analysis of Ss’ ratings of 
fatigue intensity for each of eight 
bodily locations. As shown in Table 
3, the only significant variation was 
that between body locations (p <.001), 
experimental treatments to the con- 
trary notwithstanding. These loca- 
tions were ordered in the following 
way with respect to rated fatigue in- 
tensity: arms (least), hands, neck, 
feet, eyes, legs, back, and buttocks 
(most), an order which seems intui- 
tively valid, considering the nature of 
the task. Thus, it would seem that 
the experimental treatments estab- 
lished and maintained differential task 
dispositions despite the fact that 
Ss uniformly became tired in differ- 
ent places by significantly different 
amounts. This will be an important 
fact to recall when performance effects 
are appraised. 

Performance effects —The time-on 
scores made during the 40 pretreat- 


ment trials and the 192 posttreatment 
trials constitute the basic data for the 
statistical appraisal of performance 
effects. In order to eliminate score 
fluctuations due wholly to the eccen- 
tricity of the 8-min. cam pattern in the 
cycling system, the scores for each 
cycle of eight trials were summed for 
each S, thereby producing 5 pretreat- 
ment scores and 24 posttreatment 
scores foreach S. Attention was then 
focused upon the final five cycles as 
the most rigorous test of the hypoth- 
esis under investigation. The mean 
performance levels exhibited during 
this period were converted to loga- 
rithms in order to homogeneate the 
subclass variances, and their regression 
upon pretreatment achievement levels 
was calculated. Following adjustment 
for regression, the treatment variances 
were tested against the estimate of 
error in order to determine their sta- 
tistical significance. The analysis of 
adjusted variances is shown in Table 
4, from which it is clear that very sub- 
stantial differences were produced by 
the pharmacological treatments. By 
contrast, the difference between the 
instructional treatments was negli- 
gible (F <1), and no appreciable sig- 
nificance could be attached to the 
interaction of the main effects. 

A more definitive view of the phar- 
macological variation was obtained by 


TABLE 4 


ANALYsIS OF ADJUSTED VARIANCE OF 
TERMINAL PERFORMANCE 











Source df — F 
Drugs 4 1.6648 33.30** 
Motivations 1 .00004 
DXM 4 0584 1.17 
Error 69* .0500 

Total 78* 











* Reduced by 1 df for regression (rzy = .58; 6 = 
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TABLE 5 
ApyusTep TERMINAL PERFORMANCE MEANS 
Motivations 
Drugs Means 
Ih I: 
A 1.9923 2.0252 2.0088 
B 2.0188 1.9153 1.9671 
c 2.3496 2.3427 2.3462 
D 2.2488 2.1339 2.1914 
E 1.4002 1.5828 1.4915 
Means 2.0019 2.0000 2.0010 














computing the adjusted means and 
testing their differences by the ¢ sta- 
tistic. The means are shown in 
Table 5. Tests of the null hypothesis 
against appropriate one-sided alterna- 
tives permit its rejection for all com- 
parisons except that between control 
and lactose groups (p=.03). This out- 
come can be more fully appreciated by 
reference to Fig. 1, which plots the 
general course of performance through- 


out the posttreatment period for each 
pharmacological group. 

The broad picture is now partially 
clear. The instructional treatments 
induced significantly divergent task 
dispositions, as operationally defined, 
but their assumed dynamic properties 
were not evident in performance, nor 
did comparable analyses of earlier 
cycles reveal them. The analeptic 
treatments likewise allayed the devel- 
opment of unfavorable dispositions, 
but, as expected, they also allayed the 
loss of task proficiency. The ques- 
tion now is whether the performance 
outcome can be accounted for by the 
effects of drugs upon dispositions. 

In view of the instructional out- 
comes, we shall consider the hypoth- 
esis about the effects of attitudes upon 
performance to be impugned if signifi- 
cant drug variation remains after the 
terminal performance scores have been 
adjusted for their multiple regression 
upon initial scores and attitude levels. 
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Fic. 1. Effects of the pharmacological treatments upon task performance 
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The analysis of the adjusted variances 
is given in Table 6, and the adjusted 
means are recorded in Table 7. These 
tables make it clear that very signifi- 
cant drug variation occurs in inde- 
pendence of the attitude variation. 
There is some decrement in the mean 
for dextro-amphetamine, but it remains 
significantly superior to the others. 
It is interesting to note in Table 6 
that attitude scores are uncorrelated 
with pretreatment achievement scores 
(r = .01), but correlated somewhat 
with terminal scores (r = .38, p<.01). 
This constitutes some small bias in 
favor of the working hypothesis, but 
in the absence of confirming evidence 
from the instructional treatments it 
seems best to say that these covariant 
outcomes offer little if any indication 
that task dispositions constitute a link 
between the experimental treatments 
and terminal performance. 


A final word should be said about the 
drug effects themselves. These results, 
although striking, are not novel (4, 6, 15, 
16). They do, however, raise a question 
concerning the way in which the drugs 
achieve their effects upon performance, if 
not through the motivational properties 
of the subjective dispositions they create, 
as some have suggested (1, 2,6). While 
this is largely a matter of speculation at 
the present time, it seems nevertheless 


TABLE 6 


ANALYsIS OF VARIANCE OF TERMINAL PER- 
FORMANCE AFTER ADJUSTMENT FOR 
ATTITUDE AND PRETREATMENT 











LEVELs 
Source df a, F 
Drugs 4 1.3522 | 33.81** 
Motivations 1 0435 1.09 
DXM 4 .0790 1.98 
Error 68* .0400 
Total 77* 














* Reduced by 2 df for multiple regression. Ry.z: = 
69; rez = .O1; Fey = 38; rey = .58 
** > <.001. 


TABLE 7 


TERMINAL PERFORMANCE MEans ADJUSTED FOR 
ATTITUDE AND PRETREATMENT 

















LEVELS 
Motivations 
Drugs Means 
Ih Ie 

A 2.0012 2.0466 2.0239 

B 2.1445 1.9717 2.0581 

. 2.2536 2.2479 2.2508 

D 2.2715 2.0596 2.1656 

E 1.4499 1.5630 1.5065 
Means 2.0241 1.9778 2.0010 














likely that their effects can be viewed in 
terms of their presence in the body fluids 
which bathe or penetrate the nerve cells 
mediating task performance (13, 14). 
Their effects upon cell respiration, with 
implied changes in the efficiency of exci- 
tation and conduction processes, are 
probably of crucial importance. In order 
to view the effects in this way, it seems 
necessary first to postulate some sort of 
change in oxidative metabolism through 
repetitive exercise of the fibers mediating 
task performance (see 5), a postulate 
which is accredited to some extent at the 
behavioral level by the fact that the posi- 
tive influence of analeptic drugs upon 
performance is best demonstrated under 
conditions of task prolongation (3, 17). 
In the case of the amphetamines, depend- 
ing upon the dosage, there is some evi- 
dence that enzymatic activity, carbo- 
hydrate metabolism, basal metabolism, 
and blood flow are increased, and some 
believe that the most logical explanation 
of their cellular effects lies in their capac- 
ity to inhibit the enzyme, amine oxidase, 
which normally alters certain enzymes in 
the brain to form aldehydes that depress 
tissue respiration (6, 7). Thus, they 
may enhance cell respiration, hence pro- 
long the efficiency of excitation and con- 
duction functions, by allaying the forma- 
tion of aldehydes. Similar general syste- 
mic effects have been observed in the case 
of caffeine and its derivatives, but details 
at the cellular level remain obscure. 
Likewise, there is little information about 
the mechanism of action at the cellular 
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level in the case of the depressant prepa- 
ration used. 


SUMMARY 


Eighty Ss received preliminary training on a 
complicated compensatory pursuit task involv- 
ing simulated aircraft instruments and controls, 
then continued work for 4 hr. after having been 
distributed among ten combinations of two moti- 
vational and five pharmacological conditions. 
The treatment effects were appraised in terms of 
subjective dispositions and task performance, 
and the functional connection between these out- 
comes was explored. Both motivational and 
pharmacological treatments induced divergent 
task dispositions. Motivational effects, how- 
ever, were not discernible in performance. Large 
performance effects due to drugs could only 
slightly, if at all, be accounted for by subjective 
effects. While it may be conservative to suppose 
that these results are specific to the conditions of 
the present study, they nevertheless tend to dis- 
credit the general practice of invoking attitudinal 
constructs to explain work-decrement phenom- 
ena in the absence of confirming experimental 
demonstrations. 


REFERENCES 


1. Barmack, J. E. The effect of benzedrine 
sulfate (benzyl methyl carbinamine) upon 
the report of boredom and other factors. 
J. Psychol., 1938, 5, 125-133. 

2. Barack, J. E. The effect of 10 mg. of 
benzedrine sulfate on the Otis test scores 
of college students. Amer. J. Psychiat., 
1940, 97, 163-166. 

3. Barack, J.E. The time of administration 
and some effects of 2 grs. of alkaloid caf- 
feine. J. exp. Psychol., 1940, 27, 690- 
698. 

4. Bartuey, S. H., & Cuute, E. Fatigue and 
impairment in man. New York: Mc- 
Graw-Hill, 1947. 

5. Brink, F., Jn. Excitation and conduction 
in the neuron. In S. S. Stevens (Ed.), 


10. 


11. 


12. 


13. 


14. 
15. 


16. 


17. 


. Goopman, L., & Gitman, A. 


Handbook of experimental psychology. 
New York: Wiley, 1951. Pp. 50-93. 
The pharma- 


cological basis of therapeutics. New York: 
Macmillan, 1941. 
. Krantz, J. C., & Carr, C. J. The pharma- 


cologic principles of medical practice. 
Baltimore: Williams & Wilkins, 1951. 


. Metron, A. W. (Ed.) Apparatus tests. 


Washington: U. S. Government Printing 
Office, 1947. (AAF Aviat. Psychol. Pro- 
gram Res. Rep. No 4. Restricted Supple- 
ment.) 


. Myers, C. S. The study of fatigue. /. 


Pers. Res., 1925, 3, 321-334. 

Payne, R. B., & Hauty,G.T. The effects 
of motion sickness preventives upon cer- 
tain perceptual-motor components of the 
pilot’s task. USAF Sch. Aviat. Med., 
Proj. No. 21-1601-0004, Rep. No. 3. 

Payne, R. B., Moore, E. W., & Betuurum, 
J. L. The effects of certain motion sick- 
ness preventives upon psychological effi- 
ciency. USAF Sch. Aviat. Med., Proj. 
No. 21-32-019, Rep. No. 1. 

Payne, R. B., Oster, D. R., & Tomuinson, 
P. A. S. The effects of certain motion 
sickness preventives upon navigator pro- 
ficiency. USAF Sch. Aviat. Med., Proj. 
No. 21-1601-0004, Rep. No. 1. 

Rosson, J. M., & Keere, C. A. Recent 
advances in pharmacology. Philadelphia: 
Blakiston, 1950. 

Satter, W. T. A textbook of pharmacology. 
Philadelphia: Saunders, 1952. 

Suocx, N. W. Some psychophysiological 
relations. Psychol. Bull., 1939, 36, 447- 
476. 

Spracc, S. D. S. The effects of certain 
drugs on mental and motor efficiency. 
Psychol. Bull., 1941, 38, 354-363. 

Trier, D. B. The effect of amphetamine 
sulfate and some barbiturates on the 
fatigue produced by prolonged wakeful- 
ness. Amer. J. Physiol., 1947, 150, 
253-262. 


(Received July 23, 1953) 





Journal of Experimental Psychology 
Vol. 47, No. 4, 1954 


TRANSFER OF PERCEPTUAL SET! 
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AND DELOS D. WICKENS 
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Much of the research in the field of 
transfer of training has been concerned 
with the manner in which transfer 
varies as a function of the degree of 
similarity between the stimulus-re- 
sponse units of which the two tasks 
are composed, or between the indi- 
vidual stimuli or responses of the 
tasks. Frequently the concepts of 
stimulus and response generalization 
have been employed in theoretical 
analysis, and by so doing the analysis 
is freed to some extent of a dependence 
upon identical elements between the 
two tasks. Transfer then is couched 
in terms of stimulus dimension-—re- 
sponse dimension units rather than 
specific S—R units. 

There are, however, instances of 
transfer which cannot readily be under- 
stood in terms of the relationships 
among stimulus dimension and re- 
sponse dimension units. One example 
of this sort is found in the behavior 
which gave rise to Harlow’s (1) con- 
cept of learning sets, for his monkeys 
showed a generalized ability to solve a 
class of problems even though the par- 
ticular S—-R associations acquired in 
one problem could not be transferred 
to the next. McGeoch and Irion (5) 
have used the term learning-how-to- 
learn to refer to certain laboratory 
experiments which demonstrate the 
same generalized kind of transfer, and 
other examples can be found in work 
on the solution of successive rational 


1 The data for this experiment were collected 
at The Ohio State University under Contract AF 
33 (038)-15474 and have been previously reported 
in AF Technical Report No. 6566. 


learning problems and in transfer in 
puzzle solution. 

In fact, it is probable that in every- 
day life the most considerable portion 
of the significant positive transfer 
which occurs is between tasks for 
which no simple dimensional analysis 
of the stimulus and response similarity 
is possible. In other words, we learn 
how to solve problems much more fre- 
quently than solutions to problems. 

In two recent experiments, Law- 
rence (2, 3) has presented data that 
suggest that rats may, through certain 
kinds of prior training, develop tend- 
encies to disregard certain dimensions 
of sensory data, and this disregard 
may lead to negative transfer if in a 
later task this dimension contains the 
cue stimulus. It was the purpose of 
the present experiment to extend this 
type of transfer investigation into a 
more complicated problem with hu- 
man Ss. Specifically, it is hypothe- 
sized that a training task functions to 
establish perceptual sets, and that, 
consequently, the utilization of a given 
cue or combination of cues in a later, 
similar task will be a function of the 
cue relevance pattern existing in the 
training task. 


MeETHOD 


Apparatus—The apparatus consisted of a 
vertical display panel containing an aperture 
approximately at eye level and a horizontal reac- 
tion panel containing a starting key and four 
reaction keys. A 40-step stepping switch was 
mounted behind the display panel. By mount- 
ing an aluminum disc on the shaft of the stepping 
switch and placing 40 stimuli at proper intervals 
around the perimeter of the disc, it was possible 
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to present a continuous series of stimuli to S. 
Different series of stimuli could be presented 
simply by preparing other discs and changing 
them as desired. 

The apparatus was wired so that there could 
be a fixed connection between a particular stim- 
ulus appearing in the aperture and a particular 
response key. The S’s task was that of learning 
to associate each of the four reaction keys with 
its appropriate stimulus. If the correct reaction 
key was depressed, a new stimulus appeared 
when §S returned his hand to the starting key. 
If an incorrect key was depressed, no new stim- 
ulus appeared in the aperture when S returned 
to the starting key, and he had to continue in 
his attempts to select the correct key. 

Procedure —The stimuli used in this study 
were forms cut from construction paper of vari- 
ous colors. Thus, when S saw a stimulus in the 
aperture he could respond to its form, its color, 
or perhaps to both aspects of the stimulus. For 
each of the tasks used in the experiment, four 
forms and four colors were selected (Fig. 1). 
By arranging the forms on the stimulus disc in 
such a way that each of the four forms had a 
fixed connection with one of the response keys 
and by assigning colors to the forms in a random 
manner, it was possible to make form a relevant 
cue and color an irrelevant cue; i.e., the problem 
could be solved by utilizing the form quality of 
the stimulus, but not by utilizing its color qual- 
ity. In a similar manner color could be made 


























cue 
TRAINING 
TASK 1 
COLORS VIOLET REO ORANGE GREEN 
| EIOOA 
cue 
TRAINING 
TASK 2 
COLORS GREY YELLow TAN BUFF 
GREEN 
STANDARD 
AND 
TEST TASKS 
COLORS PINK YELLOW BLUE BROWN 
CORRECT KEY ‘ 2 4 ‘ 
RESPONSES 














Fic. 1. Forms and colors used in the experi- 
mental tasks and correct key response for each 
stimulus 


relevant and form irrelevant. By pairing color 
and form in a systematic manner and arranging 
the stimuli on the disc properly, it was possible 
to construct a task in which both color and form 
were relevant. Thus, it was possible to give Ss 
three different kinds of cue training. They could 
be trained on (a) a task in which successful per- 
formance was possible only if color cues were 
attended to and form cues neglected, (b) a task 
which demanded that form cues be attended to 
and color cues neglected, or (c) a task which 
could be learned on the basis of either color or 
form or both. 

Three groups of 40 Ss each were used, one 
group for each of the three types of cue training. 
The cue training consisted of learning two sep- 
arate tasks in which the forms and colors used 
were different, but the cue relevance pattern 
remained the same. This was done to accom- 
plish more firmly the purpose of the training 
tasks in establishing cue attention habits and to 
minimize the tendency of Ss to shift from using 
one cue to another merely as a function of going 
from one task to another. Figure 1 shows the 
forms and colors used in the two cue-training 
tasks as well as the particular key correct for each 
stimulus. Each of these tasks was learned to a 
criterion of ten consecutive errorless responses. 

After completing the learning of their respec- 
tive cue-training tasks, all three groups learned 
the same standard task. In this task, both form 
and color were relevant and the forms and colors 
used were different from those used in either of 
the cue-training tasks. Thus, the standard task 
was a completely new task in the sense that Ss 
knew nothing about the stimulus-response asso- 
ciations to be formed. The three groups dif- 
fered, however, in their previous experience with 
regard to what cues are relevant in such tasks. 
All groups learned the standard task to a cri- 
terion of ten consecutive errorless responses. 

Following the preparatory and standard tasks, 
a test task was presented. This consisted of discs 
in which the stimulus objects were either colors 
alone or forms alone; they will be referred to 
hereafter as test task-color and test task-form 
respectively. These tasks were introduced to 
determine whether or not the preliminary train- 
ing had a selective influence on which one of the 
cues available in the standard task was used to 
learn the task. Learning was again to a crite- 
rion of ten consecutive errorless responses. The 
form-relevant, color-absent condition was accom- 
plished by cutting the appropriate forms from 
black construction paper. The color-relevant, 
form-absent condition was accomplished by 
having the appropriately colored construction 
paper cover enough of the disc so that color 
alone appeared in the aperture. The forms or 
the colors used in the two test tasks were the 
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TABLE 1 


ScHEMATIC DESCRIPTION OF THE 
EXPERIMENTAL PROCEDURE 























Condition* 
Task Consonant Dissonant Ambiguous 
a b a b a b 
Training 1 |F,C; |FiC, |FiC, |F,Ci |F-C, | FC 
Training 2 F.C; F,C, F;C, F,C; F.C, Pike 
Standard |F,C, |F,C, |F.C, |F.C, |F,CG | FC 
Test PK, FAG FG IG FAG IBG 























*F = form, C = color, r = relevant,i = irrelevant, 
a = absent. 


same as the forms and colors employed in the 
standard task. In addition, the S-R relation- 
ship between the forms or colors in the two test 
tasks was identical to the S-R relationship for 
the forms and colors in the standard task. 

The rationale for utilizing this procedure is as 
follows. On the standard task all Ss in all 
groups could have solved the problem by using 
either the form dimension, the color dimension, 
or both. If perceptual biasing operates, how- 
ever, it would be expected that Ss would be likely 
to be responsive to one dimension and not to the 
other, or at least less so. In addition, the dimen- 
sion to which they are responsive would, we 
would expect, be determined by the previous 
experience of relevance-irrelevance during the 
two training tasks. One could not determine 
which dimension dominated in the standard task 
from an analysis of this task alone because both 
were available. In the test task only one dimen- 


sion is available to Ss. If this were the same 
dimension that had been employed in the stand- 
ard task, then S’s performance on the test task 
should be better than if he had employed the 
other dimension during the learning of the stand- 
ard task. For these Ss, the test task constitutes 
essentially a recall task. Thus, by varying the 
dimension available in the test task one may 
infer which dimension was utilized in learning 
the standard task. If there were no differences 
among the groups in the learning of the test 
tasks, then, of course, there would be no evidence 
for perceptual biasing. 

In summary, all groups learned the standard 
task with both cues relevant, after having had 
different experience with regard to cue relevance 
in the cue-training tasks. Performance on the 
test tasks, in each case, was used to determine 
what effects the preliminary task experience had 
upon the utilization of cues in learning the stand- 
ard task. 

The experimental procedure is indicated 
schematically in Table 1. The three major 
groups of the experiment are referred to by the 
terms consonant, dissonant, and ambiguous. 
By consonant it is meant that the same stimulus 
dimension which was required for solution in the 
first two tasks was the only dimension available 
in the test task; by dissonant it is meant that 
the dimension on the test task differed from that 
employed on the first two; and by ambiguous it 
is meant that both dimensions were usable in the 
first two tasks. For each major grouping there 
is a counterbalancing for form and color. 

Subjects—The Ss were 120 students from the 
elementary psychology classes at The Ohio State 
University. They were assigned to the various 


experimental conditions in a random manner, 


























TABLE 2 
Mean NumBeEr or RESPONSES TO AND INCLUDING CRITERION FoR Eacu Conpition on Eacu Task 
Condition 
Task Consonant Dissonant Ambiguous 
a b a b a b 
Training 1 M 67.25 65.00 92.60 61.10 44.95 
SD 35.94 39.14 47.32 34.95 24.92 35.50 
Training 2 M 32.10 44.60 54.20 33.65 43.15 30.35 
SD 20.23 39.02 33.81 25.69 27.77 11.56 
Standard M 24.75 20.30 20.75 23.30 30.45 23.60 
SD 12.25 8.07 8.18 11.56 14.61 10.85 
Test M 16.25 12.10 28.00 17.25 24.10 15.85 
SD 6.56 2.51 12.95 5.92 14.25 7.19 
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Male and female Ss were nearly equally dis- 
tributed within each group. 


RESULTS 


Table 2 presents the mean number 
of responses to and including the crite- 
rion for each of the groups on each 
task. A response is defined as the 
occurrence of a single S-R sequence 
whether correct or not. Since the 
criterion responses are included in the 
scores presented in Table 2, ten should 
be subtracted from each score in order 
to find the number of responses pre- 
ceding the first of the sequence of 
criterion responses. 

The data relevant to the hypothesis 
are those that allow comparisons of 
the number of trials to learn the test 
tasks among the various experimental 
groups, i.e., among groups having 
learned the standard task following 
different types of cue training. If dif- 
ferences among such groups exist, we 
have evidence for the fact that a train- 
ing task can function so as to exert an 
influence upon which of the cues avail- 
able in another task situation will be 
utilized by the trainee. 

The last row of Table 2 presents the 
average number of responses required 
by each of the experimental subgroups 
to learn the two test tasks to a crite- 
rion of ten consecutive errorless 
responses. 

It is apparent from an examination 
of this table that the most rapid learn- 
ing of the test task was found in the 
consonant groups and the least rapid 
in the dissonant. Because of inequal- 
ity of variance among the groups, the 
Mann-Whitney U test was employed 
to determine the significance of the 
differences (4). The consonant form 
group differs from the dissonant form 
at a p level of .001, and from the 
ambiguous form at a level of .01; 
while the ambiguous form differs from 
the dissonant at a level of .02. The 


consonant color group differs from the 
dissonant at a level of .001, and from 
the ambiguous at a level of .11; while 
the ambiguous differs from the dis- 
sonant at the .05 level. These are all 
single-tailed tests. 


Discussion 


It may be inferred from the perform- 
ance on the test tasks that the prior expe- 
rience with the relevance and irrelevance 
of certain dimensions on the first two 
tasks developed biases which influenced 
the predominating cue on the third task. 
In this task either dimension—that of 
form or of color—could be employed, but 
the results imply that Ss were more 
responsive to one dimension than another 
and the direction of this responsiveness 
was related to the nature of the pre- 
training. 

A simple stimulus generalization inter- 
pretation of these results would demand 
that there be a high degree of similarity 
of the form (or color) with which a 
response was associated in Tasks 1 and 2, 
and the form (or color) with which that 
response was associated in Task 3. An 
examination of Fig. 1 will indicate that 
no obvious systematic similarities of 
stimulus-response units persisted through- 
out the tasks. It is suggested that in 
tasks of this nature Ss may respond not 
only by pressing a key but may verbalize, 
stating perhaps “color (form) is rele- 
vant,” not “yellow (square) for this 
key.” This ability to name an entire 
dimension, and obtain reinforcement for 
that response, frees S from strict depend- 
ence upon physical or semantic similarity. 
It seems altogether possible that the be- 
havior of subhumans would follow much 
more closely the prediction made from a 
stimulus-response generalization analysis. 
It would also seem probable that human 
Ss working with unfamiliar dimensions 
to which they have learned no inclusive 
symbol would show the more limited type 
of transfer expected from stimulus sim- 
ilarity. Wickens (6) has recently at- 
tempted to sketch out the S-R concepts 
which might be utilized in explaining 
transfer for nonverbalized dimensions, 
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but relatively little systematic work has 
been devoted to analysis of transfer 
which may be dependent upon verbaliza- 
tion of stimulus dimensions. 


SUMMARY 


This experiment was conducted to test the 
influence of perceptual biases produced by pre- 
training upon learning in a later situation. 

The task was a paired-associate type activity 
in which pressing one of four keys was deter- 
mined by a visual stimulus. The visual stimuli 
contained two dimensions, form and color. Each 
S learned four tasks—two pretraining tasks 
which varied from group to group, a common 
task, and a test task in which only one of the two 
stimulus dimensions was present. The common 
task was one in which both dimensions were rele- 
vant to solution, and the question posed by the 
experiment was whether or not the pretraining 
would influence the dimension employed in learn- 
ing this task. The operation of this bias was 
inferred by the performance on the final task in 
which only a single dimension was available to 
Ss. 

For a consonant group only one dimension, 
either form or color, was relevant to solution of 
the two pretraining tasks, and this same dimen- 
sion was the sole dimension present in the test 
task. For a dissonant group, the dimension 
which was relevant in the pretraining was absent 


in the test task. For an ambiguous group both ° 
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dimensions were relevant in the pretraining 
tasks. 

The influence of the pretraining was shown by 
the fact that on the test task the consonant 
group learned with greatest ease, the ambiguous 
was next, and the dissonant was the poorest. 
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JUDGMENT OF TIME AS A FUNCTION OF SERIAL 
POSITION AND STRESS 


JOHN L. FALK! AND DALBIR BINDRA 
McGill University 


In a recent experiment on temporal 
judgment Eson and Kafka (2) noted 
that overestimation of time was 
greater in the initial trials (judgments) 
than in later trials of the same series. 
However, these investigators ,worked 
with only a short series of four trials, 
and did not systematically explore the 
effect of serial position on temporal 
judgment. One of the objects of the 
present experiment was to study this 
effect over a greater range of trials. 

A second aim of this study was to 
investigate the effect of mild stress on 
the judgment of time. Postman and 
Schneider (6) have pointed out that 
for needs, expectancies, etc. to meas- 
urably affect perception, ‘the “stim- 
ulus information” given to S must be 
meager or poorly structured. Thus, 
judgments of time, with the paucity of 
cues on which they are based, and 
their dependence upon memory, should 
provide an ideal means for studying 
the effects of directive or motivational 
factors in perception. A number of 
earlier studies have shown that specific 
motivational factors, such as goal- 
direction and “‘need tension”’ affect the 
estimation of time (4, 5, 7). In the 
present experiment stress was used as 
a motivational variable, and the judg- 
ments of time made while Ss antici- 
pated receiving an electric shock were 
compared with judgments made under 
nonstress conditions. 


MeETHOD 


Subjects —Forty Ss were divided equally 
between an experimental and a control group. 


1 This is a condensed presentation of material 
submitted in partial fulfillment of the require- 
ments for the M.A. degree at McGill University 
(3). This research was supported in part by the 
National Research Council of Canada. 


The Ss were mainly college students, with a mean 
age of 25 yr. and a range of 15 to 36 yr. Each 
group contained 14 males and 6 females. 

Apparatus.—The apparatus consisted of the 
following items, powered by appropriate arrange- 
ments of line-current and dry-cell batteries: (a) 
a chronometer (1/100 sec.) was set in operation 
when S pressed a key, and stopped when S 
released it; (b) a memory drum which exposed 
the various signals to S; (c) an inductorium 
which delivered a mild shock to S via a pair of 
spring clamps over the fingers of his left hand 
(the shock was usually described as somewhat 
painful, and S usually winced and made slight 
withdrawal movements when it was adminis- 
tered); (d) a polygraph which produced a con- 
tinuous record of palmar resistance; (¢) a small 
desk fan, used to mask the sounds coming from 
E’s side of a partitioning screen; and (f) a 
standard doorbeil wired to a push button. 

The production method—The production 
method of temporal judgment was used in the 
present study. In this method, S marks off an 
interval which has been verbally stated by £. 
In our experiment S was asked to mark off 15 sec. 
by pressing and then releasing a key. The 
actual time for which S keeps the key pressed 
will be called Mapsed time (2), and it represents 
the length of time that, in S’s judgment, is equal 
to 15sec. In using the production method, over- 
estimation of time means that S allows less 
chronological time to elapse before he considers 
the stated value as having been reached. Thus, 
the smaller the elapsed time, the greater the 
overestimation. 

Procedure.—The Ss were tested individually. 
Each S was informed that the experiment would 
last about 45 min., and appointments were 
arranged only for those times at which Ss had no 
pressing obligations. If, during the experiment, 
S asked how much longer the experiment would 
last, he was told: “I would rather not say, since 
it is possible that it may affect your results. 
However, you know that the whole experiment 
lasts about 45 min.” Only two or three Ss asked 
for such information. ‘The Ss were given no clue 
as to the accuracy of their time judgments. 

In the experimental group the GSR elec- 
trodes, well smeared with electrode jelly, were 
attached to S’s right hand The index and 


2 Although GSR records were obtained from 
all Ss, their analysis has not been carried out. 
As is well known, meaningful analysis of GSR 
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middle fingers of his left hand were placed in the 
spring clamps which were wired to the induc- 
torium. After noting the basal resistance of S’s 
palm, E came from behind the partitioning screen 
and read the following instructions to S: 

“This is an experiment in time estimation. 
You are to attempt to estimate an interval of 15 
sec. on each trial. You will make your estimate 
of 15 sec. by pressing down this key (E indicates 
the chronometer key to S) and holding it down 
until you think 15 sec. have gone by. Then 
release it. Try pressing the key and holding it 
down, just to get the feel of it. (S presses and 
releases the key a few times.) Just before each 
trial I will say ‘ready’ and then as a signal for 
you to start your estimation you will hear a click 
from this box (£ points to the memory drum), 
and you will see either a red or a blue mark 
appear in this slot (E points to the slot in the 
memory drum). Then you will press the key 
down and keep looking at the mark until you 
think 15 sec. have gone by. Then release the 
key. 

“Try not to shift your position or move about 
much, especially during the trials. At other 
times tell me if you shift, move the chair, or 
yawn. Keep your right arm still, in the position 
most comfortable to you where you can reach 
the key easily. That’s because this (£ points to 
the GSR electrodes) is such a sensitive indicator 
of activity. When you finish each estimate, that 
is, when you take your finger off the key, you 
will either hear a bell ring or feel a slight shock in 
your two fingers. If the mark showing has been 
blue, you will hear a bell ring like this. (£ rings 
the bell.) If the mark showing has been red, you 
will feel a slight shock in your fingers like this. 
(E shocks S.) 

“Remember: First I will say ‘ready.’ Then 
comes a click which means you are to begin. 
Look at the mark which appears in the slot, press 
the key down, and hold it down until you think 
15 sec. have passed. Do not try to count off the 
seconds to yourself. I’m just interested in what 
you feel 15 sec. is like—so don’t count. When 
you feel that 15 sec. have passed, release the key. 
There will be a short rest between trials. Do 
you have any questions? (There were usually 
no questions.) All right. We'll try a few prac- 
tice trials to begin with.” 

The sequence was as described in the instruc- 
tions. The pause between trials (from the occur- 
rence of the bell or shock to the beginning of the 
next trial) was about 25 to 30 sec. 

Five “practice trials” were given during which 
only blue marks appeared, and consequently only 


data is fraught with theoretical as well as prac- 
tical difficulties. However, from a cursory exam- 
ination of the records, a hypothesis connecting 
palmar conductance to temporal judgments in 
any simple manner would appear to be ill- 
founded. 


bells (no shock) occurred when the key was 
released at the conclusion of the judgment. 
When five practice trials were completed S was 
told: “I think you’ve had enough practice now. 
You’ve got the idea. Now we'll begin.” Each 
S was given 20 trials, 10 with blue (bell) signals 
and 10 with red (shock) signals in a predeter- 
mined “random” order. 

When these trials were completed, E told S: 
“The experiment is actually over now, but I’d 
like to do just a few more trials for the sake of 
comparison. Is that all right?” The S invari- 
ably consented, and five more “extra trials” were 
given during which only blue signals appeared. 
Thus there were altogether 30 trials. 

The control group was treated in exactly the 
same way as the experimental group, except that 
no mention of shock was made in the instruc- 
tions, nor was any shock administered. This 
group was told that after a trial with a red signal 
they would “hear a slight twanging sound.” 
The “twanging sound,” which was present for 
the experimental group as well, was produced by 
pressing and subsequently releasing the button 
(which delivered the shock to the experimental 
group) on the inductorium. 


RESULTS 


The 30 judgments (5 practice trials, 
20 experimental trials, and 5 extra 
trials) made by each S were divided 
into six blocks of five judgments each, 
and the mean of each block of five 
successive judgments was calculated. 
These subject means were then aver- 
aged for the 20 Ss. The six serial 
means obtained for each group are 
plotted in Fig. 1, and were found to lie 
in a more or less linear fashion for 
both the experimental and control 
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Fic. 1. Mean elapsed time of intervals 
judged to be 15 sec., as a function of the serial 
order of the judgment 
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TABLE 1 


Duration (tn Seconps) or Jupcep 15-Sec. 
INTERVALS UNDER VARIOUS CoNDITIONS 














Experi- Control 
tal 
Measure Group segues 
Mean| SD | Mean| SD 
Blue signal (bell) 14.9 | 5.0 | 18.8} 12.3 
Red signal (shock for 
exp. group) 14.6 | 4.8 | 19.6} 13.6 
Total (including prac- 
tice and extra trials) | 14.8 | 5.4 | 18.9| 13.4 

















groups. This relationship will here- 
after be referred to as the serial posi- 
tion gradient. 

Since both the serial position gradi- 
ents approximated linearity, the usual 
methods for testing the significance of 
the slope of a line from a slope of zero, 
and for testing the significance of the 
difference between two slopes, were 
used. The regression coefficient, or 
slope of the line, is significantly differ- 
ent from zero for the experimental 
group (p<.01) and for the control 
group (p<.05), but the difference be- 
tween the two regression coefficients is 
not significant (p>.10). The differ- 
ence between the SD’s of the groups 
(see Table 1—Total) is significant 
(p<.001), and the difference between 
the means is significant (p<.01). 

Breaking down the 20 “experimen- 
tal trials,” the means of the blue-signal 
(bell) trials and red-signal (shock for 
experimental group) trials were cal- 
culated. These means are not signifi- 
cantly different for either the experi- 
mental or the control group. 


Discussion 


The serial position gradient.—The 
gradients for both groups confirm the one 
found by Eson and Kafka (2), who also 
used the production method. These Es 
demonstrated a similar, significant serial 
position gradient over the four trials they 
utilized, and concluded that there was, 
in general, overestimation of time in a 
time-estimation—oriented task, the effect 


of serial position being to successively 
decrease the amount of this overestima- 
tion. Eson and Kafka interpreted this 
trend as an improvement in judging the 
interval (15 sec.). Such an interpreta- 
tion is not supported by our results, since 
with an increased number of trials the 
elapsed time (5S’s judgment) passes 
through 15 sec. and continues to increase. 

The work done on the time-order error 
in temporal judgments appears to rule it 
out as an explanation of the serial posi- 
tion gradient. Both Stott (8) and Wood- 
row (9) found that with the comparison 
of two equal intervals, the second one is 
overestimated. Therefore, the conclusion 
may be drawn that if a time-order error 
is operative under the conditions of the 
production method, the serial position 
gradient occurs in spite of it. 

Effect of stress—The experimental 
group was given shock and nonshock 
trials. It was felt that the expectancy of 
pain on the shock trials would induce a 
mild state of anxiety in Ss during the 
judging of those particular intervals. 
Under such uncomfortable  circum- 
stances, it was hypothesized that time 
would be overestimated. The situation 
would parallel the overestimation of time 
typical in states of pain or intense antici- 
pation. However, no significant differ- 
ence was found between the shock and 
the nonshock trials for the experimental 
group. 

It is possible that Ss did not discrimi- 
nate motivationally between the shock 
and nonshock signals, but that they, 
nevertheless, were affected by the shock 
situation. This appears to be a valid 
interpretation in view of the fact that the 
position of the gradient for the experi- 
mental group is displaced downward, i.e., 
toward overestimation, with respect to 
the control group, the mean elapsed time 
for the experimental group being signifi- 
cantly lower than that of the control 
group. Evidently, the knowledge that 
shock was forthcoming did not affect only 
the shock trials, but affected equally all 
the judgments made in the shock situa- 
tion (experimental group). 

The results also show that the differ- 
ence between the two slopes is not statis- 
tically significant. This means that the 
administration of shock, as carried out in 
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this experiment, does not affect the s/ope 
of the serial position gradient. 

The significant downward displace- 
ment of the experimental group’s gradi- 
ent may be interpreted as either (a) a 
shift toward greater accuracy, or (4) a 
shift due to the anticipation of painful 
shock. 

Concerning the first alternative, an 
argument similar to the following could 
be constructed for its support. The 
shock, by making Ss more anxious, may 
have served to make the experimental 
group, on the whole, more alert or “‘vigi- 
lant.” With this higher level of “‘vigi- 
lance” operating, Ss would tend to be 
more accurate; hence the closer approxi- 
mation to 15 sec. achieved by the ex- 


_perimental group (mean = 14.8 sec.). 


Bruner and Postman (1) have proposed 
the construct of “vigilance” to explain 
cases in which increased stress is asso- 
ciated with increased accuracy of per- 
ception. Thus, the smaller deviation of 
the mean of the elapsed times from 15 
sec. in the case of the experimental group 
could be interpreted as indicating a 
“vigilant” attitude resulting in greater 
over-all accuracy of judgment. 

The second alternative makes no 
assumptions concerning the relatively 
greater accuracy of the experimental 
group. The finding that the experimen- 
tal group’s judgments are displaced 
toward overestimation can be looked upon 
as being independent of the notion of 
accuracy, the displacement of the judg- 
ments being attributed solely to the more 
anxious set induced by shock. Such a 
set also means that E has controlled the 
“mental content” of these Ss to a greater 
extent than that of the control Ss, there- 
by reducing the variability (SD). We 
prefer this second alternative: The 
smaller mean of the experimental group 
is not the result of increased accuracy due 
to “vigilance,” but represents a general 
displacement of the judgments toward 
overestimation (relative to the control 
group) owing to a more anxious set in- 
duced by the expectation of shock. 


SUMMARY 


The object of this study was to explore the 
effects of the serial position of judgment and 
mild stress, induced by the threat of impending 
shock, upon judgment of a 15-sec. interval by the 


production method. An experimental group 
(N = 20) received shock at the end of certain 
trials, and a control group (N = 20) never 
received shock. The S was forewarned at the 
beginning of each time interval of the type of 
consequences to be received at the end of the 
interval. 

Statistically significant serial position gradi- 
ents for temporal judgments were found in both 
groups. These gradients were more or less lin- 
ear, and ran from overestimation in the earlier 
trials to underestimation in the later trials. 

The anxiety-pfoducing situation (experimen- 
tal group) led to a significantly greater overesti- 
mation than a comparatively neutral situation, 
but the direction and slope of the serial position 
gradient was not significantly altered by the dif- 
ferences in treatment given the experimental and 
control groups. 

It is argued that the serial position gradient 
does not result either from time-order error, or 
from an improvement in ability to judge time. 
Greater overestimation of the interval by the 
experimental group (relative to the control 
group) is interpreted in terms of an anxious set 
induced by the expectation of shock. 
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SMOOTH EYE MOVEMENTS IN THE ABSENCE OF A 
MOVING VISUAL STIMULUS! 


GERALD WESTHEIMER 
University of Houston 
AND DONALD W. CONOVER 
U. S. Navy Electronics Laboratory 


Conjugate eye movements involving 
cortical areas are generally classified 
(3) under two headings: (a) saccadic 
movements, and (b) pursuit move- 
ments. The two types of movements 
have quite different characteristics 
and are easily distinguishable on pho- 
tographic eye movement records. Pur- 
suit movements are generally very 
smooth and involve relatively slow 
changes in the position of the eye in 
the orbit. Conversely, saccadic move- 
ments of the eyes are rapid and show 
defined changes in velocity during 
their course (5). 

It is generally stated that the stim- 
ulus to pursuit movements must be a 
moving stimulus and that pursuit 
movements cannot be executed in the 
absence of such a stimulus (6). Sup- 
porting evidence for this view is sought 
in pathological conditions in which pa- 
tients with certain neurological lesions 
are said not to be able to move their 
eyes “at will” but can follow a slowly 
moving stimulus object (1). This type 
of evidence has led to the postulation of 
neurological pathways subserving eye 
movements (4) which separate the 
centers and some of the pathways into 
those subserving “voluntary move- 
ments” and those concerned with “‘in- 
voluntary” or “reflex” or “pursuit” 
movements. No clear distinction is 
made in the literature between 
“smooth” and pursuit-type move- 

1 Based on an experiment carried out in the 


Physiological Optics Laboratory of The Ohio 
State University. 


ments, and it is generally stated that 
all eye movements (excepting those of 
labyrinthine or other lower reflex 
origin) are saccadic unless there is a 
moving visual stimulus in the field. 
The purpose of this paper is to report 
the occurrence of smooth eye move- 
ments in the absence of an external 
moving visual stimulus. 


METHOD 


Apparatus.—The recording equipment used 
in this experiment consisted of a modified Amer- 
ican Optical Company “Ophthalmograph” eye 
movement camera. The modifications were 
chiefly in the direction of increasing the resolu- 
tion in the time dimension. By appropriate 
optical devices within the camera, only the hori- 
zontal components of the motions of the two 
eyes were recorded. For a precise time base on 
the film, a beam of light, interrupted 100 times a 
second by a synchronous chopper, was recorded 
along with the corneal reflex from both eyes. 
The Ss were firmly anchored to the camera with 
a full impression of their teeth in a bite bar. 
The only visual stimuli in S’s field of regard, 
other than the lights used to produce the corneal 
reflection, were two fixation-point lights (10’ in 
diameter) separated by 30° at the nodal point of 
the eye projected upon a large screen at a dis- 
tance of 150 cm. from S’s eye. 

Procedure-—The method is similar to those 
employed by Westheimer (5) in his study of eye 
movements in response to visual stimuli in the 
horizontal plane, and by Conover and Stroud (2) 
in their investigation of eye movements in follow- 
ing what they described as imaginary target mo- 
tions. No special preparation is required of S; 
two students with normal ocular motility and 
without previous training in the experimental 
task were used as Ss of this study. 

Trial runs (using other Ss) had shown that 
the experimental task could be carried out in a 
lighted room or in the dark without significant 
effects upon S’s performance. The two fixation 
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Fic. 1. Tracing of photographic record of 
movements of both eyes in response to instruc- 
tions to move eyes back and forth smoothly and 
rapidly between two luminous points 30° apart 
in a horizontal plane 


lights were turned on and S was instructed to 
move his eyes as rapidly and as smoothly as 
possible between the two fixation points. Fif- 
teen seconds of S’s response were recorded each 
session, making a total of 45 sec. of record for 
each of the two Ss. 


REsuULTS 


In both Ss considerable stretches of 
the eye movement records obtained in 
this situation looked like the excerpt 
shown in Fig. 1. The movements 
have all the characteristics of smooth 
or pursuit-type motions of the eye. 
For purposes of comparison, a section 
of a record containing both smooth 
movements and saccadic movements 








TIME 











Fic. 2. Tracing of photographic record of 
movements of both eyes in response to instruc- 
tions to move back and forth smoothly between 
two luminous points 30° apart in a horizontal 
plane. Tracing shows both smooth movements 
and saccadic movements. 


is shown in Fig. 2. The record shown 
in Fig. 1 is, in all respects, identical 
with records obtained of the eyes in 
following a stimulus moving in the 
horizontal direction with velocity and 
acceleration changes dictated by a 
sine function at the maximum rate at 
which these can be carried out, viz., 
about three cycles per second. When 
Ss attempted to move their eyes 
smoothly but slowly from one position 
to the other, no such pursuit-type 
movement was evident. 


SUMMARY 


It was demonstrated that eye movements 
fitting in all respects the conventional descrip- 
tion of smooth or pursuit movements can occur 
in the absence of a moving visual stimulus in a 
situation where there is no reason to expect a 
lower reflex arc to be in operation. It has gen- 
erally been believed that such movements require 
for their initiation the presence of a moving 
visual stimulus. The evidence presented here 
suggests that, at least in the case of the present 
two Ss, the pathways subserving smooth move- 
ments can be activated without a moving visual 
stimulus. 
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